Harmonic Analysis: 
A Gentle Introduction 


Carl L. DeVito, PhD 


University of Arizona 


JONES AND BARTLETT PUBLISHERS 
Sudbury, Massachusetts 
BOSTON TORONTO LONDON SINGAPORE 


World Headquarters 


Jones and Bartlett Jones and Bartlett Publishers § Jones and Bartlett Publishers 
Publishers Canada International 

40 Tall Pine Drive 6339 Ormindale Way Barb House, Barb Mews 

Sudbury, MA 01776 Mississauga, Ontario London W6 7PA 

978-443-5000 LSV 1J2 UK 

info@jbpub.com CANADA 

www.jbpub.com 


Jones and Bartlett's books and products are available through most bookstores and online 
booksellers. To contact Jones and Bartlett Publishers directly, call 800-832-0034, fax 978- 
443-8000, or visit our website www.jbpub.com. 


Substantial discounts on bulk quantities of Jones and Bartlett's publications are available 
to corporations, professional associations, and other qualified organizations. For details 


and specific discount information, contact the special sales department at Jones and 
Bartlett via the above contact information or send an email to specialsales@ jbpub.com. 


Copyright © 2007 by Jones and Bartlett Publishers, Inc. 


ISBN- 13: 978-0-7637-3893-8 
ISBN- 10: 0-7637-3893-X 


All rights reserved. No part of the material protected by this copyright may be reproduced or 
utilized in any form, electronic or mechanical, including photocopying, recording, or by any 
information storage and retricval system, without written permission from the copyright 
owner. 


Library of Congress Cataloging-in-Publication Data 


DeVito, Carl L. 
Harmonic analysis : a gentle introduction / Carl L. DeVito. 
p. cm. 
Includes bibliographical references and index. 
ISBN-13: 978-0-7637-3893-8 
ISBN-10: 0-7637-3893-X 
1. Harmonic analysis. I. Title. 
QA403.D457 2007 
515°.2433—dc22 
2006010535 


6048 


Production Credits 

Acquisitions Editor: Tim Anderson 
Production Director: Amy Rose 
Editorial Assistant: Laura Pagluica 
Production Editor: Tracey Chapman 
Production Assistant: Jamie Chase 
Manufacturing Buyer: Therese Connell 
Marketing Manager: Andrea DeFronzo 
Composition: Northeast Compositors 
Cover Design: Timothy Dziewit 

Cover Image: © Bob Wainsworth/ShutterStock, Inc. 
Printing and Binding: Malloy, Inc. 
Cover Printing: John Pow Co. 


Printed in the United States of America 
10 09 08 07 06 10098765432 1 


Contents 


Preface v 


Chapter 0 Preliminaries 1! 
1. Set Theory |! 
2. Relations and Functions 4 
3. The Real Number System 7 
4. The Complex Number System I! 
5S. Analysis 15 


Chapter 1. Classical Harmonic Analysis 23 

1. The Dirichlet Problem for a Disk 24 

2. Continuous Functions on the Unit Circle 25 

3. The Method of Fourier 29 

4. Uniform Convergence 32 
5. The Formulas of Euler 39 
6. Cesaro Convergence 43 
7. Fejér's Theorem 47 
8. At Last the Solution 52 


Chapter2 Extensions of the Classical Theory 57 
1. Functions on (-m%, 1%) 57 
2. Functions on Other Intervals 61 
3. Functions with Special Properties 65 
4. Pointwise Convergence of the Fourier Series 70 


iv Contents 


Chapter3 Fourier Series in Hilbert Space 79 

. Normed Vector Spaces 80 

. Convergence in Normed Spaces 84 

Inner Product Spaces 90 

. Infinite Orthonormal Sets, Hilbert Space 96 
. The Completion 103 

. Wavelets 107 


DAunhkwn — 


Chapter4 The Fourier Transform 113 

. The Fourier Transform on Z 113 
. Invertible Elements in €! (Z) 119 
The Fourier Transform on R= 123 
Naive Group Theory 128 

Not So Naive Group Theory 131 
Finite Fourier Transform 135 
An Application 143 

Some Algebraic Matters 150 
Prime Numbers 154 

Euler’s Phi Function 159 


POI DAH AWK 


=) 


Chapter5 Abstract Algebra 165 
I. Groups 165 
2. Morphisms 169 
3. Rings 172 
4. Fields 175 
Appendix A Linear Algebra 179 
Appendix B The Completion 181 
Appendix C Solutions to the Starred Problems 187 


Index 213 


Preface 


Many branches of mathematics come together in harmonic analysis, each adding 
richness to the subject and each giving insights into this fascinating field. My 
purpose in writing this book is to give the reader an appreciation for all this 
interaction without overwhelming him or her with technical details. I also hope 
to inspire the reader to learn more about harmonic analysis and about inathe- 
matics in general. 

This book can be read by any student who has had a course in advanced 
calculus and has some acquaintance with linear algebra. My students were 
mostly mathematics majors in their junior or senior year. Fourier transforms, 
the fast Fourier transform, even abstract algebra, have all made their way 
into modern engineering and into many areas of modern science. Some of my 
students were majoring in these subjects. For them, I usually asked that they 
knew some linear algebra and had previously taken an “Advanced Analysis for 
Engineers” type of course. Typically these students had trouble with the proofs, 
but, they told me, they had little interest in the proofs and were more concerned 
with understanding the definitions and doing the calculations. Because of the 
varied backgrounds of my students, I have tried to make the book as self- 
contained as possible. 

The text begins with a very elementary preliminary chapter (Chapter Zero). 
This contains the kind of material that every student has seen but many have 
forgotten. It is there mainly for reference so that a student who has forgotten 
some fact about complex numbers, or some theorem from calculus, can refresh 
his or her memory. The only thing that might be new to some readers is the 
derivation of the Cauchy-Ricmann equations. These are used to convince the 
student that Laplace’s equation has lots of particular solutions. We refer to 
this fact in Chapter One where we define harmonic functions. It also tells thie 
student why solving partial differential equations requires a new technique. 

Chapter One contains a discussion of the Dirichlet problem for a circle with 
continuous boundary data. Here harmonic functions and periodic functions 
are treated. We also touch on ordinary differential equations and the method 
of separation of variables for partial differential equations. There is a section 
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dealing with uniform convergence. This can, of course, be skipped or assigned 
as reading if the students are familiar with this topic. We use this material to 
derive the Euler formulas. The theorem of Fejér, that the Cesaro means of the 
Fourier series of a continuous function on the circle converge to that function 
uniformly over the circle, is proved in Section 7. We use that result to solve the 
Dirichlet problem and to prove the Weierstrass approximation theorem. 

The purpose of Chapter One is to give the students a thorough foundation 
in basic analysis as it relates to trigonometric series. Once this has been mas- 
tered. the student is ready to learn something about the apphcations of these 
series and about the more abstract approaches to their study. 

In Chapter Two we discuss the way a trigonometric series can be used 
to represent functions of arbitrary period, functions defined only on an inter- 
val, and functions with discontinuities. Lots of examples of functions and their 
Fourier series are given. Much of this material can be covered quickly, and 
it is not used in any subsequent section. There is, however, a theorem about 
pointwise convergence that will be new to most students. 

In Chapter Three we look at Fourier series in Hilbert space. Here a little 
linear algebra is needed, and a brief discussion of the material required is given 
in Appendix A. We discuss inner product spaces and normed spaces and their 
interrelation. Banach and Hilbert spaces are defined and their basic properties 
investigated. A proof that every normed space has a completion that is a Banach 
space is given in Appendix B. There we also prove that the completion of an 
inner product space is a Hilbert space. These proofs are lengthy and a bit 
tedious. which is why we put them in an appendix. Instructors can cover them 
or simply assign them as reading. 

In the remainder of Chapter Three, we define L?(7'), T being the unit circle, 
to be the completion of the continuous functions on T, C(T), with the integral 
inner product. We then prove that {e'"'//27} is an orthonormal basis for this 
Hilbert space. Once again the theorem of Fejér is used. We end Chapter Three 
with a brief discussion of wavelets. Here we define the Hilbert space L? (R) and 
show how certain functions give rise to orthonormal bases for this space. 

Chapter Four deals with the Fourier transform. We start with the vector 
space ¢!(Z): sequences {a(n)} such that 5° |a(n)}| < oo. We define the con- 
volution operation and the Fourier transform for elements of this space. The 
advantage in doing that is that all the proofs can be given, and in this case 
they are quite transparent. 

Next we look at elements of £1(Z) that are invertible with respect to con- 
volution. To show that there is a connection between these ideas and Fourier 
series, we discuss Wiener’s theorem: If f € CT) is never zero and the Fourier 
series of f is absolutely convergent, then the Fourier series of 1/f is absolutely 
convergent. This connection motivates a further investigation into the prop- 
erties of ¢'(Z) under convolution, and we end Section 2 with a sketch of the 
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proof of Wiener’s result. Although the proof is, necessarily, incomplete, I believe 
I have done enough to give the reader a pretty good idea of what is involved. 

In Section 3 we discuss the Fourier transform on R. Because we are not 
assuming a knowledge of the Lebesgue integral we pursue, as far as possible, the 
analogy with @!(Z). Many examples of functions and their Fourier transforms 
are presented, and we show, by means of an example, how the Fourier transform 
can be used to solve a differential equation. 

Reviewing the first three sections of this chapter we observe that the Fourier 
transform involves maps from Z into T and continuous maps from R into T, 
that “preserve the algebra” [i.e., maps c such that c(m +n) = e(m) - e(n)}. 
This, of course, takes us into group theory and the notion of a homomorphisin. 
We do not use this terminology, however, until Chapter Five. Here we define, 
very simply, subsets of C that are groups under addition or under multipli- 
cation. The discussion soon leads us to Z, (the integers modulo nr) and its 
additive structure. By investigating maps from Z,, into T that “preserve the 
algebra” we are led to the group of nth roots of unity and the finite Fourier 
transform. This transform is of central importance in certain branches of en- 
gineering (e.g., digital signal processing). We discuss its properties and show 
how it arises, rather mysteriously, in Lagrange’s treatment of equations. This 
is done in Section 7. The calculations are a little lengthy, but this material is 
not used in later sections and so it can be skipped or assigned as reading. 

Sections 8-10 contain some number theory. In Section 8, we show when a 
power of a primitive nth root of unity is again primitive, we discuss the multi- 
plicative structure of Z,, and characterize those elements that have inverses, and 
we define the Euler phi function. Some instructors may want to skip sections 9 
and 10 and go directly to Chapter Five. Finding a formula for the phi function 
requires us to discuss primes, and to keep things interesting, we mention per- 
fect numbers and their relation to Mersenne primes. At this point we have done 
enough modular arithmetic to prove the interesting half of the Lucas-Lehmer 
test, which tells us when 2? — 1 is a prime. We end Chapter Four with a brief 
discussion of how the phi function enters into the creation of cyphers. 

The final chapter of the book contains the basic definitions from abstract 
algebra. Here we define groups and point out the many examples of groups 
that have occurred at varions places in the text. We define homomorphisms and 
isomorphisms and explain that the maps froin Z into T, discussed in connection 
with the Fourier transform, are examples of these things. The Fourier transform 
itself is an isomorphism between rings, and so we define rings and fields and 
point out these facts. A short treatment of the properties of finite fields (these 
do arise in the applications of harmonic analysis) is also given. 

The detailed discussion of the Dirichlet problem given in Chapter One is 
intended as motivation for the study of trigonometric series. Instructors who 
wish to by-pass this material and begin their course with the series themselves 
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may easily do so. Simply cover Sections 4, 5, 6, and 7 of Chapter One and omit 
Section 8. The remainder of the text is then accessible. 

Those instructors who want to get to the Fourier Transform as rapidly as 
possible may do so by covering Sections 4~7 of Chapter One, then Sections 1 
and 2 of Chapter Three, and the material in Section 5 up to Theorem 4 (in 
particular, Definition 1 and Theorem 3). I have tried to get the reader actively 
involved in the development of the mathematics. To accomplish this, I put some 
results, results referred to later in the text, in the exercises. Any such exercise 
is marked with a star and should be worked, or at least read, by anyone using 
the book. Solutions to the starred problems are given in Appendix C. 

I received helpful suggestions about various parts of the book from a great 
many people. I would now like to thank Nirmal Bose (Pennsylvania State Uni- 
versity), James Kang (California State Polytechnic University), David Walnut 
(George Mason University), Selin Aviyente (Michigan State University), Kasso 
Okoudjou (Cornell University), Ilya Pollak (Purdue University), and John Brill- 
hart and William Farris (University of Arlizona). It is my pleasure to also thank 
my Belgian colleague Maurice Hasson for reading the entire manuscript and 
making many valuable suggestions. I would also like to thank Victoria Milne 
for typing the manuscript and having infinite patience during its many revi- 
sions. 


Chapter Zero 
Preliminaries 


All the mathematics needed to read this book is outlined in this chapter. Thie 
advantage in doing this is that the material is then easy for us to reference 
and, if necessary, easy for the reader to review. The notations and conventions 
used in the book are also explained here, but these are, for the most part, quite 
standard. 


1. Set Theory 


In ordinary English the word we use for a collection of objects depends on what 
those objects are. So a collection of cows is a herd, while a collection of sheep 
is a flock. A collection of owls is a parliament, while a collection of ravens is a 
murder. We even have a name for a collection of unicorns. It’s called a blessing. 
In the terminology of mathematics the word “set” is used for any collection of 
objects, whatever those objects may be. These collections, however. inust be 
well defined; i.e., it must be clear just which objects are in a given collection 
and which objects are not. The collection of all great novels, for example, is not 
a set because there is no gencral agreement, even among experts, as to what 
constitutes a “great” novel. Moreover, are we to include in our collection novels 
yet to be written or only those in print today? One faces similar difficulties 
with any collection of physical entities. The only “real” sets are collections 
of mathematical objects. Still, it is often illuminating to consider collections 
of physical entities and to call them sets provided one understands that this 
practice is really just a pedagogical device whose value is more psychological 
than mathematical. 

The objects comprising a set are called its “members” or its “elements.” 
Given a set X and an object xz we write x € X (read “x belongs to X”) when 
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r is a member of the set X, and we write z ¢ X when this is not the case. A 
set is completely determined by its elements. More formally: 


Definition 1. Two sets X and Y are said to be equal, written X = Y, if 
every element of X is an element of Y and every element of Y is an element of 


X. 


Sets that contain only a few objects can be specified by simply listing their 
members. To emphasize the fact that we want to consider the set, a new 
“entity” formed by the act of collecting these objects together, we list them 
between a pair of curly brackets. So {a,b,c} is the set consisting of the three 
letters a. b, and c from the English alphabet. This notation is impractical when 
we work with sets that contain a great many members. The next definition will 
help us deal with that situation. 


Definition 2. Let X,Y be two sets. We shall say that X is a subset of Y, 
and write X C Y, if every element of X is also an element of Y. 


When X C Y, we shall say that X is contained in or included in Y, and 
we shall say that Y contains, or includes, X. It is clear that X = Y if, and 
only if. we have both X C Y and Y C X. This observation is useful because 
in many problems the easiest way to show that two sets are equal is to show 
that each contains the other. (See Exercises 2, problem 1b.) 

It is often convenient to define a set by specifying that its members are all 
elenients of a known set, X say, and that they are distinguished from the other 
elements of X by the fact that they satisfy a certain condition. For example, if 
X is the set of all atoms, we may want to consider the subset of X defined as 
follows: {2 € X|z is a hydrogen atom}. The vertical bar is read “such that” 
and the total expression reads “the set of all x in X such that =z is a hydrogen 
atom.” When this notation is used, it is assumed that the condition specified 
is clear enough to define a unique subcollection of X. 


Definition 3. Given two sets X and Y we define two new sets as follows: 
(a) X U Y(read “X union Y”) is the set consisting of those objects that are 
members of X or of Y, and (b) X MY (read “X intersect Y”) is the set 
consisting of those objects that are members of X and Y. 


It is clear that X UY contains both X and Y and that X NY is contained 
in both X and Y. Moreover, because we use “or” in the inclusive sense, we 
alwavs have XN Y C X UY. The set with no members is called the empty set 
and is denoted by @. From Definition 2 we see that @ is a subset of any set X. 
This is consistent with Definition 3 because @= O@N X C X. 


Definition 4. Two sets X and Y are said to be disjoint if X NY = @. 
It is a curious fact that the empty set is contained in, yet disjoint from, every 
Set. 
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A set that contains a single element, say z, is called a singleton, or the 
singleton defined by z. We denote this set by {zx}. If y is another object 
different from zx, then the set {z,y} is called the unordered pair defined by z 
and y. This terminology reflects the fact that {z,y} = {y,z}. We shall need 
to consider the ordered pair defined by z and y. 


Definition 5. Given two objects x and y we define their ordered pair, (z, y), 
to be the set {{z}, {z, y}}. Given two sets X and Y, the set XxY = {(z,y)|z € 
X and y € Y} is called the Cartesian product of X and Y. 


We leave it to the reader to show that (z. y) # (y,z) unless we happen to 
have z = y (Exercises 1, problem 2). The Cartesian product of a set X with 
itself, X x X, is sometimes denoted by X?. 


Exercises 1 


A number of useful facts, facts that are referred to later in the text, are scattered 
among the exercises. In some of the exercises we introduce terminology that 
is also used later. Any problem that is referred to later in the text is marked 
with an asterisk. 


1. Show that © is not equal to {@} and that neither of these sets is equal 
to {2, {O}}. 
*2. Given two objects z and y we have defined (z, y) to be the set {{:r}, {z, y}}. 


(a) Show that (z, y) = (y,z) if, and only if, z = y. 
(b) Show that (z, y) = (u,v) if, and only if, z = u and y= v. 


This exercise justifies our calling (z,y) the ordered pair defined by z and 
y. 


+3. Let X,Y be two sets. We define X — Y to be {x € X|z ¢ Y}. Show that 
X-Y=X -(XNY). 


*4. Let E be a fixed set and let X be any subset of FE. The set E — X is 
called the complement of X (in £) and is denoted Cge(X). Let X,Y 
denote arbitrary subsets of E. 


(a) Prove DeMorgan’s theorem; i.e., show that 
i. Ce(X UY) = Ce(X)NCE(Y); 
ii. Ce(X NY) = Ce(X)UCE(Y). 
(b) Show that X -Y=XN[CeE(X NY)} = XNCE(Y). 
(c) Show that the following are equivalent: 
ij XUY=E 
ii. Ce(X) CY 
iii. Ce(Y) CX 
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(d) Show that the following are equivalent: 


i X CY 

li. Ce(X) > Ce(Y) 
iii, XUY =Y 

iv. XNY =X 


2. Relations and Functions 


We want to introduce some concepts, and the terminology associated with 
them, that will be useful throughout the book. To enhance the applicability of 
these ideas, we present them in a very general abstract setting. Some concrete 
examples, however, are given here and many more are given in subsequent 
sections. 

Throughout this section, X and Y denote nonempty sets and X x Y denotes 
their Cartesian product (Section 1, Definition 5). 


Definition 1. Any nonempty subset R of X x Y is called a relation from X 
to Y. We shall say that z € X and y € Y are R related, and we shall write 
rRy, if (x,y) € R. The two sets D(R) C X and R(R) C Y, called, respectively, 
the domain and the range of R, are defined as follows: D(R) = {z € X| there is 
ayé€Y with (z,y) € R}, R(R) = {y € Y| there is an z € X with (z,y) € R}. 


We can define a relation from the set M of all men to the set W of all 
women by taking R = {(m,w) € M x W| m and w are married (to each 
other)}. Then D(A) is the set of all married men and R(R) is the set of all 
married women. 


A relation from a set X to itself (i.e., a subset of X x X) is called a relation 
on X. A relation R on X is said to be (a) reflexive, if rRz for every x € X; 
(b) symmetric, if z,y in X and zRy implies yRz; (c) transitive, if for z, y, z in 
X., we have zRz whenever we have both rRy and yRz. 

A relation that has all three of these properties (i.e., a relation that is 
reflexive, symmetric, and transitive) is called an equivalence relation on X. 

Referring to the set W we can take w = w’ to mean that the two women w 
and u” are sisters. (Let’s agree that two, not necessarily different, women are 
sisters if they have the same parents.) It is easy to see that = is an equivalence 
relation on W. 


Definition 2. A relation R from a set X to a set Y is called a function from 
X to Y or a mapping from X into Y, if (a) the domain of R is X and (b) 
whenever (z, 3) and (z,y’) are both in R, we must have y = 7’. 


We can rephrase (b) as rRy and xRy’ together imply y = y’. A relation R 
that has property (b) but not (a) is sometimes called a partial function from 
X to Y. In this case, R is a function from X’ to Y where X' = D(R). 
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In a monogamous, or a polyandrous, society, the relation from M to W 
defined above is a partial function. In a polygamous society, it is not. When f 
is a partial function from X to Y, it is customary to write y = f(z) instead of 
zfy or (z,y) € f. We shall also use the syinbol f : X — Y to abbreviate the 
phrase “f is a function from X into Y.” Let us emphasize that when we write 
f :X —Y the domain of f is all of X. The set Y, in this case, is often called 
the codomain of f and the range of f is, of course, a subset of Y. 


Given f : X — Y we can, for each y € Y, define a subset pre,(y) of X as 
follows: pre,(y) = {z € X| f(x) = y}. We call this set the preimage of y. Note 
that because f is a function, the sets pre,;(y) and pre,(y’) are disjoint unless 
y = y'. Moreover, pre,(y) #@ if, and only if, y is in the range of f. There are 
two important special cases here. (i) When f(z) = f(z’) implies z = x’. In 
this case, every nonempty pre,(y) is a singleton. (ii) When the range of f is 
all of Y. In this case, no prey(y) is empty. 


Definition 3. A function f from X to Y is said to be one-to-one if f(z) = 
f(z’) implies x = z’. We shall say that f is onto or that f maps X onto Y if 
the range of f is Y. 


Observe that when f : X — Y is both one-to-one and onto we can define 
a function g : Y — X by setting g(y) = pre,(y) for every y € Y. 


A set whose elements are themselves sets is often called a family of sets. 
Given a family of sets A we can always “name” the members of J as follows: 
Choose a set 7 and a one-to-one, onto function y : ] — A. We call y an 
indexing of A and we call J the index set. It is customary to denote the set 
(A) € A by A) and to write A= {A)|A € 1}. We now define 


LJA= J Ar = {z|x € A) for some 4 € [} 
rAEl 


and 


()A= () A) = {z|z € A) for every 4 € I} 
Ael 


A family of sets A = {A)| A € /} is said to be pairwise disjoint if ANA, = 
@ whenever A, are in J and AF pu. 


Given a function f from X to Y, the set R(f) indexes the family of 
nonempty preimages of f; ie., {pre,(y)|y € R(f)}. This is a family of 
nonempty pairwise disjoint sets whose union is X. 


Axiom of Choice: When f : X — Y is onto it seems evident that there 
“should be” a function g : Y — X that is one-to-one; for each of the sets pre ,(y) 
is nonempty so we may set g(y) equal to any member of this set. This seems to 
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give us a function, and because these preimages are pairwise disjoint, g must be 
one-to-one. The trouble with this is that we have not specified which element 
of pres(y) is to be g(y); hence, we have not really defined anything here. There 
is, in fact, no way to specify g(y) in the general case. To define g, we must 
appeal to a famous axiom: Given any nonempty family of nonempty sets, there 
is a function that assigns to each set in the family of an element of that set. 
This is the axiom of choice. It has many applications and many equivalent 
forms and we discuss some of them later (see Section 2 of Chapter 4). 


Exercises 2 


*1. Let = be an equivalence relation on a set X. For each zr € X let (z] = 
{y € X|xz = y}. We call (z] the equivalence class defined by z. 


(a) Show that for each z € X, [x] #2. 
(b) Show that for z, y in X, [xz] NM [y] #2 if, and only if, (x] = [y}. 


Thus the equivalence classes of X are either identical or they are disjoint. 
*2. Let f : X — Y and let g be a partial function from Y to a set Z. 


(a) If the domain of g contains the range of f, show that setting (g 0 
f)(xz) = g[f(x)] for each x € X defines a function, go f, from X to 
Z. We call this the composition of f and g. 

(b) Show that f is both one-to-one and onto if, and only if, there is a 
function g : Y — X such that (f og)(y) = y for all y € Y and 
(g° f)(x) = x for each x € X. When such a g exists, we call it the 
inverse of f and denote it by f—!. 


+3. For any set X, let P(X) denote the family of all subsets of X. 


(a) Define a relation = on P(X) as follows: For Y,Z € P(X), Y = Z if, 
and only if, there is a function f from Z to Y that is both one-to-one 
and onto. Show that = is an equivalence relation on P(X). 


(b) Define a relation < on P(X) as follows: For Y,Z in P(X), Y < Z 
if, and only if, Y C Z. 
i. Show that the relation < is reflexive and transitive. 
ii. Show that < is antisymmetric, meaning Y < Z and Z < Y 
together imply Y = Z. 


A relation R on a set X that is reflexive, antisymmetric, 
and transitive is called a partial order relation on X or a partial 
ordering of X. The term “partial” refers to the fact that there 
may be elements of X that are incomparable, i.e., elements z, y 
of X such that neither rRy nor yz is true. A partial order 
relation on a set X is called a total order relation or a total 
ordering of X if any two elements of X are comparable. 
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4. Suppose that for some set X, we have a function g : X — P(X) that is 
onto. Then for each x € X, g(r) € P(X) (i-e., g(x) is a subset of X) 
and so it makes sense to ask whether or not z is an element of g(z). Now 
let Xo = {x € X|z ¢ g(x)}. This is a subset of X, hence an element of 
P(X). There is an zp € X such that 9g(z0) = Xo because g is onto. 


(a) Show that zo € g(zo) leads to a contradiction. 


(b) Show that ro ¢ g(zo) leads to a contradiction. 


We conclude that for any set X there is no function from X to 
P(X) that is onto. 


(c) The famous Russell paradox arises when we consider the “set” of all 
sets that are not members of tliemselves (e.g., the set of all books is 
not a book). If there were such a set, let’s call it S, then we have 
both S € S and S ¢ S. Paradoxes like this led to the creation of 
axiomatic set theory. 


#5. Let {A,|A € I} be a family of sets and suppose that there is a set X 
such that A, C X for every \ € J. Prove the strong form of DeMorgan’s 
theorems, i.e., show that 


(a) Cx [Une Aa] = Nae Cx(Aa) and 
(b) Cx [Nyer Aa] = Uner Cx(Aar) 


3. The Real Number System 


The real numbers were introduced into mathematics to provide a foundation 
for analysis (i.e., calculus and its off-shoots). We assume the reader has some 
familiarity with this set, but a brief review of its basic properties is well worth- 
while. 

To begin with, any two real numbers can be added and multiplied and 
the result is a real number. Moreover, this set, which we shall denote by R, 
contains two special members 0 and 1 that have the properties: 


(1) r+0=0+2=2 forallreR 
(2) r-l=1-r=2 forallreR 


These numbers, which are clearly unique (see Exercises 3, problem 1), 
are called, respectively, the additive and multiplicative identities of R. Each 
element z € R has a unique additive inverse, that is, there is a unique element 
(—z) € R such that 


(3) r+(-z) =(-z)+2r2=0 
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Thus we inay, for any two real numbers z and y, define z — y to mean 
r+(—y). 

Any nonzero real number z has a unique multiplicative inverse. This is the 
number, we shall denote it by «~!, for which 


(4) r-cob=27!.7r=1 


For any x,y in R with y # 0, we define z/y to mean z-y—'. We assume that 
the reader is familiar with the various properties of these arithmetic operations. 


There is a subset P of R that has the following properties: 


(5) For any z € R exactly one of the statements z € P, (-z) € P,x = 0, is 
true. 


(6) If c,y are in P so also are z + y and x: y. 


It is easy to see that the number 1 is an element of the set P, whereas 
the number 0 is not. The members of P are called the positive real numbers. 
Given r,y, in R we say that z is less than y or y is greater than z, and write 
xr < y, if y—z belongs to P. This gives us a relation on the set R that we shall 
call the order relation on R. It is clear from (5) that for any two distinct real 
numbers z and y we have z < y or y < z (but not both), and from (6) we see 
that z < y implies r+ z < y+ 2 for all z E R, and x-z < y-z for all z € P. 


Because 1 € P we see from (6) that 2 = 1+ 1 is in P and so is 3 = 2+ 1, 
etc. The subset of R generated in this way is denoted by N and is called the 
set of natural numbers. It has many fascinating properties but the one that we 
shall need in the sequel is this: 


(7) Any nonempty set of natural numbers contains a smallest member. 


The set R does not have property (7) because it has no smallest member. 
However, R does have an important property that can be stated in terms of 
the order relation. First, we need some terminology. Let S be a nonempty set 
of real numbers. A real number a such that a < s (read “a is less than or equal 
to s”) for all s € S is said to be a lower bound for S. Similarly, a number 6 € R 
is said to be an upper bound for S if s < 6 for all s € S. A nonempty subset of 
R is said to be a bounded set if it has both an upper and a lower bound. By 
convention the empty set, 2, is also called a bounded set. 


Definition 1. Let S be a nonempty set of real numbers. A real number dg is 
said to be the (see Exercises 3, problem 2c) least upper bound for S if: (i) bo 
is an upper bound for S, and (ii) if 6 is any upper bound for S, then bg < 6. 


We inay now state one of the key properties of the set R. 
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(8) Any nonempty set of real numbers that has an upper bound has a least 
upper bound. 


There are many ways to study the real numbers. In some treatments (8) 
is taken as an axiom and referred to as the axiom of continuity. In others it is 
a theorem and called the completeness property of R. Let us show how some 
of the most familiar facts abont R follow easily from (8). The results we have 
in mind are well known but, as we shall see in the sequel, extremely useful. 


Theorem 1. Let a.b be any two positive real numbers. Then there is a 
natural number n such that a < n- b. 


Proof. If a < b, we need only take n = 1. Suppose that nb < a for every 
natural number n. Then the set {nb|n € N}, which is certainly not einpty, has 
an upper bound, the number a, and hence, by (8), has a least upper bound 
that we shall denote by ag. Now for any n € N we have (n + 1)b < ao, which 
implies the inequality nb < ap — 6b for all n € N. Thus the number ag — 5 is also 
an upper bound for the set {nb|n € N}, and this contradicts the fact that ao is 
the least upper bound for this set. 


Corollary 1. Let a,b be any two positive real numbers. Then a = n-b+r 
where n is a natural number or zero, and r is a real number that satisfies the 
inequalities 0 < r < 6. 


Proof. Suppose b < a. Then {n € Nia < nb} is, by our theorem, nonempty 
and because it is a subset of N, it contains a smallest member that we shall 
call m; here we are using property (7). Thus we may write (m — 1)b < a < mb 
and because b < a we know that 2 < m. It follows that the number m - 1, 
that we shall denote by n, is a natural number. Thus, except when a = mb in 
which case r = 0, we have a = (m— 1)b+r = nb+r where r is the real number 
a —nb. Clearly, r < 6 in either case. 


Theorein 1 has another useful corollary. Before we can state it we must 
recall two important subsets of R. First, we have the set of integers, Z, obtained 
from N by adjoining zero and the additive inverse of each of its members. So 

(9) Z = {0, +1, +2,...} 
Next there is the set Q of rational numbers, 


(10) Q = {xz € R|z = m/n where m,n are in Z and n # 0} 


Corollary 2. Between any two distinct real nuinbers there is a rational nuin- 
ber. 
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Proof. Let a.b be two real numbers with a < 6. We want to find two integers 
m.n such that a < 2 < 6. 

Now the numbers 1 and b — a are positive, so there is, by the theorem, a 
natural number n such that 1 < n(b— a). Choose, and fix, such an n. Now, 
again by our theorem, the set {A € Z|na < k} is nonempty. Let m be its 
smallest member. Then na < m < na+1 < na+n(b— a) = nb, giving us 
a < = < 6 and proving the corollary. 


The set R together with its order relation is, as we have just seen, a so- 
phisticated mathematical object with an intricate structure. The members of 
R are lined up (linearly ordered) by this relation but not at all like the beads 
on a string. Between any two distinct real numbers there are always more real 
numbers. Thus given a specific number it is meaningless to speak of the “next” 
one. Moreover, removing a number does separate the line into two parts; it 
does leave a “gap,” but this gap has no edges. 


Exercises 3 
1. (a) Show that the additive identity of R is unique; i.e., if o and o’ both 
satisfy equation (1), show that o = 0’. 
(b) Show that the multiplicative identity of R is unique. 


2. (a) Given real numbers z and y with x < y, show that 
i r+z<y+z for any z€ R, and 
li. 2-2 <y-z for any z € P. 


(b) For z,y in R we have defined « < y to mean z < y or z = y. Show 
that < is a total ordering of R (Exercise 2, problem 3b). 


(c) Show that a set of real numbers that has a least upper bound has 
exactly one least upper bound. 


(d) Show that any nonempty set S C R that has a lower bound has a 
greatest lower bound; i.e., there is a number ag such that 


i. do is a lower bound for S, and 
ii. if a is any lower bound for S then a < ag. 


(e) Leibnitz wanted to add infinitesimals to the real number system; i.e., 
quantities 7 such that 0 < j < + for every n € N. Show that no 
meinber of R has this property. 


3. For any z € R define |z| to be z when z € P or x = O and define |z| to 
be (—x) otherwise. Show that for any x,y in R we have 
(a) [z+ yl < [2] + ly! 
(b) |lz| —Iyll < lz - y| 
(c) |jz- yl = lz] -|yl 
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4. Suppose that for some set S we have a function f : N — S that is onto. 
Show that there is a function g: S — N that is one-to-one. Explain why 
you do not need the axiom of choice in this case. 


5. (a) Show that there are real numbers that are not rational as follows: 
Consider the real number V2. If this were rational. there would be 
m 


integers m,n such that /2 = “. Clearly, we may suppose that m 
and n have no common factors. Now deduce a contradiction. 


(b) Let @ be a real number that is not rational; such nuinbers are called 
irrational. Show that 


i. a+ is irrational for every g € Q, and a-q is irrational for every 
nonzero element of Q. 


(c) Given a,b in R with a < 6, modify the proof of Corollary 2 to 
show that there are integers m,n for which a < (@) V2 < b. Thus, 
between any two distinct real numbers, there is an irrational number. 


6. For any two real numbers a and 6, let (a,b) = {z € R|a < x < b} and let 
[a,b] = {x € Ria < x < 5B}. 


(a) Under what conditions on a and 6 is the set (a,b) empty? What 
about the set [a, b]? 


(b) When (a,b) # @, we call this set an open interval. and when 
[a,b] 4 @ we call this set a closed interval. In either case, a and 
b are the end points of the interval and 6 — a is its length. We have 
seen that every open interval contains both rational and irrational 
numbers. Is this true of every closed interval? 


7. A purely fictitious sleep researcher discovers a man with a strange pattern 
of dreaming. On any closed interval of time during which he is asleep he 
dreams a great deal, but never on the (open) middle one-third of this 
interval. Suppose this man sleeps for 1 hour. Describe, as precisely as 
possible, the set on which his dreaming takes place. Hint: Model his nap 
by the interval (0, 1]. No dreaming takes place on the interval (4. 2). 


4. The Complex Number System 


The reawakening of Europe, which began in the Italian city states, is usually 
associated with the remarkable achievements in painting, sculpture. and archi- 
tecture. But it was an intellectual rebirth and so, not surprisingly, it had a 
mathematical side. Perhaps the two most important developments were the 
discovery of the laws of perspective by Brunelleschi, which led to the creation 
of projective geometry, and the discovery of the system of complex numbers. 
These numbers are of fundamental importance in modern physics and modern 
electrical engineering, but when they were first introduced they were viewed 
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with skepticism and distrust. Let us briefly review how they came into being 
and give a quick survey of their basic properties. 

One might believe that the study of the quadratic equation would lead to 
complex quantities. This, however, was not the case. Quadratics whose solution 
involved square roots of negative numbers were simply dismissed as having no 
solutions. It was in connection with the cubic equation that this could not be 
done. A formula for solving the general cubic was found by Scipione del Ferro 
and, independently, by Niccolo Fontana, also known as Ta’taglia. In 1545 this 
formula was published by Gerolomo Cardano in a book entitled Ars Magna 
wherein he clearly states that he obtained the formula from Ta’taglia and that 
the result was also found by del Ferro. For some reason the formula has become 
associated with Cardano’s name, leading many to believe he stole it. But his 
book is widely available and the facts can be easily checked (see Note 1). 


Now the equation z* = 15z + 4 was discussed by Bombelli in his algebra 
book written in 1572. Applying Ta’taglia’s formula, he obtained the root 


r= 2+ V=121 + 2- V=121 (1) 


Bombelli wondered if, perhaps, the cube roots appearing in (1) might be 
written in the form p+ /—q and p — //—q, what we today would call complex 
conjugates (see Exercises 4, problem 1). He was able to show that they are 
indeed 2+ /—1 and 2—/—1 (see Exercises 4, problem 10). Putting these in (1) 
he obtained r = 4, which is easily seen to satisfy the equation. Now expressions 
involving square roots of negative numbers had, on occasion, been considered 
before. In fact Cardano, in his book, used them to find two numbers whose 
sum is 10 and product is 40. But Bombelli went on to develop the arithmetic 
of these expressions, giving us the rules we use today. In modern notation he 
worked with quantities of the form a + bi where a,6 are real and i? = —1 and 
he developed the arithmetic operations on these quantities. Thus the system 
of complex numbers was born. 


A major advance in our understanding of equations came in the 1620s 
when the English mathematician Thomas Harriot began writing them in the 
modern way, i.e., ay a polynomial set equal to zero. So z* = 15z + 4 became 
z* — 15x —4 = 0. Harriot noted that whenever the number r was a root of the 
equation p(z) = 0, he could find a polynomial q(z) such that p(z) = (x—r)q(z). 
Clearly, g(z) must have degree one legs than that of p(x), and thus he discovered 
the fundamental fact that the degree of p(z) tells us the number of roots of 
the equation p(x) = 0. By combining this with the work done in Italy, it 
became clear that every quadratic, every cubic, and every quartic (fourth- 
degree equations, first solved by Lodovico Ferrari, a student of Cardano’s) had 
two, three, and four, respectively, complex solutions. The natural question 
then was whether one could show that every polynomial equation of degree n 
had n complex roots. Various people did make progress with this question, but 
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complete success did not come until nearly 200 years later (1820) with the work 
of Gauss. Thus we see that complex numbers, despite their mysterious nature, 
became central to the theory of equations. 

Still people were skeptical. Euler, who gave us the definition (generalizing 
an important observation of DeMoivre; see Exercises 4, problem 3b) 


e” =cosy+isiny, yER (2) 


wrote in 1770: “All such expressions as /—1, /—2, etc., are completely impos- 
sible or imaginary numbers, since they represent roots of negative quantities; 
and of such numbers we may truly assert that they are neither nothing, nor 
greater than nothing, nor less than nothing, which necessarily constitutes them 
imaginary or impossible.” 


The first clue as to how one might define complex numbers rigorously came 
from a geometric interpretation of these quantities given, independently, by 
a Norwegian surveyor named Wessel, who presented his ideas to the Danish 
Academy of Sciences in 1797; by the astronomer Bessel in 1799; and by a Swiss 
bookkeeper named Robert Argand in 1806. These results made the connection 
between complex numbers and points in the plane, i.e., ordered pairs of real 
numbers. Essentially what they did was to identify the symbol a + bi with the 
point (a,b) or, if one preferred, with the segment joining (0,0) to (a,b). We 
shall exploit this representation below. 


Let us turn now to Gauss’ definition of the system of complex numbers 
first given in 1831. He defined his basic set, C, to be the Cartesian product 
of R with itself, the set R x R. Next he needed arithmetic operations on the 
members of C and these he defined as follows: 


(a,b) + (c,d) = (a+c,b+d) (3) 
(a, b) - (c,d) = (ac — bd, ad + bc) (4) 


for all (a,b) € C and all (c,d) € C. 

It is easy to see that (0,0) and (1,0) are, respectively, the additive and 
multiplicative identities of C. Each complex number (a, b) has a unique additive 
inverse, (—a, —b) of course, and if it is not zero, a unique multiplicative inverse 
(see Exercises 4, problem 1b). It is customary to identify R with the subset. 
{(x,y) € C|y = 0}; thus every real number is also a complex number. Note that 
the real number —1 is now identified with (—1,0). Since (0, 1)-(0, 1) = (-1.0), 
by (4), (0,1) is the mysterious 7. 


Now that we know what complex numbers really “are” we shall, as is 
customary, write them as a + bi,a and b real, i? = —1. Thus C = {a + bila,b ee 
R}. We may identify any z = a+ bi in C with the line segment joining (0,0) to 
(a,b). The length of this segment, Va? + b?, is called the modulus, or absolute 
value, of z; we shall write |z| = Va? + b. It is clear from the geometry that we 
may write a = |z|cos8,b = |z|sin@ where @ is the angle between our segment 
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and the positive z axis. The angle @ is referred to as an argument of z; obviously 
@ is not unique. Thus, for any z = a + bi in C, we have 


z=a+bi = |z|(cos@ + isin) = |zle? (5) 


These last two expressions are called the polar forms of z. For any two points 
of C, say z} = 2, + ty and z2 = r2 + iyo, we define the distance between 
these points to be |z,; — z2|. This is just (x) — 22)? + (y: — y2)* hence for 
many problems we may identify the complex plane with the plane of analytic 
geometry. 


Exercises 4 


1. For any coimplex number z = a + bi define Z, read “z bar,” to be a — bi. 
This is called the conjugate of z. 


*(a) Show that z-% = a? + &? = |2|?. 
+(b) If z 4 0, compute the multiplicative inverse of z,z~!, and show that 
it is unique. 

(c) Define f: C — C by setting f(z) = Z for all z. Show that (i) 
f(z-w) = f(z): f(w); (ii) f(z+w) = f(z)+F(w); (ili) f(az) = af(z) 
when a is real. 

(d) Use (c) to show that the complex roots of a polynomial equation 
having real coefficients must occur in conjugate pairs; i.e., if p(z) = 0, 
then we must also have p(Z) = 0. 


2. Show that e'™ + 1 = 0. This formula was once thonght to have mystical 
significance because it combines the e of analysis, the 7 of geometry, the 
zero and one of arithmetic, and the 7 of algebra. 


*3. Let z and w be two complex numbers whose polar forms are, respectively, 
z= 7r(cos@ + isin 6), w = s(cos@ + ising). 


(a) Show that z- w = rs(cos(@ + ¢) + tsin(@ + ¢)). 
(b) Prove DeMoivre’s formula: 
For every natural number n, 2" = r"(cosn6 + isin n@). 


Hint: Use the fact that any nonempty set of natural numbers has a 
smallest member. 


*4,. Let z,w be any two complex numbers. Show that 
(a) |z-w|=|z|-}w| and = (b) [z+ w] < |z| + |w}. 


*5. Assume the fundamental theorem of algebra: Any polynomial equation, 
with complex coefficients, has a complex root. Use this to show that an 
equation of degree n has n complex roots. 
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«6. Given any function f from R to C show that there are two functions g 
and h, both from R to R, such that f(z) = g(x) + th(z) for all z € R. 


Hint: Define u: C — R and v: C — R by setting, for each z = ai + 
ib,u(z) = a and v(z) = 6b. Incidently, a is called the real part of the 
complex number a + bi, and 0 is called its imaginary part. This custom 
is unfortunate because it forces us to live with the peculiar fact that the 
imaginary part of a complex number is real. 


7. We refer here to section 3, specifically to the subset P of R with properties 
(5) and (6). Show that there is no subset of C that has these properties. 


8. Describe geometrically the sets defined by the following inequalities: 
(a) |z| < 1; (b) Jz +i] < 45 (c) [2] 2 2; (d) lz] =1. 


9. Find the real part, the imaginary part (see Exercises 4, problem 6), and 
the absolute value of (a) (3i — 2)*: (b) $4; (c) sy. 


10. Compute (2 + i)? and (2 — i)*. 


11. Show that for any natural number n the number i” is equal to one, and 


of course only one, of the numbers 7°, 7!, i, i3. 


5. Analysis 


The basic notions of analysis, limit. continuity, and differentiability are formally 
the same whether our functions map R to R,R to C,C to R, or C to C. To 
exploit this fact. we shall let K and K’ denote either R or C. 


Definition 1. Let so € K and let p be a fixed, positive, real number. The 
set U,(so) = {s € K]||s — so| < p} is called an open neighborhood of so or, 


sometimes, just a neighborhood of so. If S C K and s9 € S we shall say that 
So is an interior point of S if for some p > 0,U,(s0) C S. The set S is called 
an open set if each of its points is an interior point. 


We leave it to the reader to show that ¢ and K are open sets and that (i) 
the union of any family of open sets is an open set, and (ii) the intersection of 
any finite family of open sets is an open set. 

A subset of K is called a closed set if its complement is an open set. From (i) 
and (ii) and the theorem of DeMorgan (Exercises 2, problem 5) we immediately 
obtain (iii) the intersection of any family of closed sets is a closed set. and (iv) 
the union of any finite family of closed sets is a closed set. 


A nonempty subset S of K is said to be bounded if there is a positive real 
number M such that |s| < M for all s € S. The empty set is also taken to be a 
bounded set. We recall that any set that is both closed and bounded is called 
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a compact set. These behave very much like finite sets. In particular, when S 
is a compact set, we can, from any family of open sets whose union contains S, 
extract a finite subfamily whose union contains S. 


If a function f is defined on U,(sq), except possibly at so itself, and if for 
soine ty € K we have: Given ¢ > 0 there is a 6 > 0 such that |f(s) — to] < € 
whenever 0 # |s — so| < 6, then we shall say that to is the limit of f(s) as s 
approaches $9, and we shall write jim f(s) = to. 


Definition 2. Let f: S — K’ and let so be an interior point of S. We shall 
say that f is continuous at so if lim f(s) = f(s9). We shall say that f is 
3-30 


differentiable at so if Jim F(s) — f(s0) 


exists. This limit is denoted by f’(so). 
—30 8 — 89 


It is easy to see (see Exercises 5, problem 2) that if f is differentiable at so, 
it is continuous there. We stress that differentiability is defined only at interior 
points of the domain S of f. The following amusing example shows that this 
restriction is not to be taken lightly. 


Ezample 1 
One can easily show that for any natural nuinber n, 


(x) nPF=ntnt---+n 


where there are n summands on the right-hand side of (*). Now the derivative 
of n? is 2n and that of n is 1, hence 


(xx) 2n=14+1+4+---+¢l=n 


Dividing by n, which is valid because n is a natural number and hence not zero, 
we gett 2=1.8 


Let us specialize now to partial functions from R to R. More specifically, 
we consider a function f whose domain is a closed interval [a,b]. We already 
know what it means to say that f is continuous at any point zo € (a,b). We 
shall say that f is continuous at z = a if given € > 0 there is a 6 > 0 such that 
\f(a) — f(x)| < € whenever z € [a,b] and z —a < 6. We shall say that f is 
continuous at z = 6 if given € > 0 there is a 6 > 0 such that | f(b) — f(x)| <e 
whenever z € [a,b] and b—z < 6. Not surprisingly, we say that f is continuous 
on [a,b] when it is continuous at each point of (a,b) and also continuous at 
xz =a and at z = b. Let us now review some of the basic properties of real- 
valued functions that are continuous on closed intervals (remember a, b are in 
R so the length of [a, }] is finite). We shall omit the proofs. 


Theorem 1. There are points x), 22 of [a,b] such that f(x,) < f(z) < f(z) 
for all x in {a, 6]. 
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This theorem simply states that f has a maximum and a minimum on 
(a, b]. Suppose now that y is any number that is between the two extremes of 
f. Then: 


Theorem 2. There is a point zo € [a,b] such that f(zo) = y. 


Definition 3. Let f: [a,b] — K’. We shall say that f is uniformly continuous 
on [a, }] if, given € > 0, there is a 6 > 0 such that |f(z,) — f(z2)| < € whenever 
Z,,Z2 are in (a, b] and |r, — z2| < 6. 


Theorem 3. If f: [a,b] — R is continuous on [a,b], then it is uniformly con- 
tinuous on this interval. 


Theorem 4. If f: [a,b] — R is continuous on [a, 6], then the Riemann in- 
tegral of f, f. f(z)dzx exists. Moreover, there is a point zo € [a,b] such that 


f° f(z)dz = f(x0)(b - a). 


The second statement is an immediate consequence of theorems 1 and 2. 
Finally, we have the mean-value theorem: 


Theorem 5. If f: [a,b] — R is continuous and has a derivative at each point 
of (a,b), then to any A such that a < a+h < 6 there corresponds a number 
6,0 <6@< 1 such that f(a+h) — f(a) = hf'(a+ Oh). 


Let us turn now to functions from C to C. Simple examples are as follows: 
f(z) = 2° = (x + iy)? = (x? — y’) + i(2zy) (1) 
f(z) =e? = e7* = e* (cosy + isiny) = e* cosy + i(e” siny) (2) 


It is clear, as we have illustrated, that any f: C — C can be written in the 
form 


f(z) = u(z,y) + iv(z, y) (3) 


where u and v are real-valued functions defined on R?. Suppose now that f is 
differentiable at z9 = 29 + iyo. Then f is defined on some neighborhood U,( zo) 
and 

; Zz) —- Jie ’ 


s—* 20 er ae 


We must obtain the same number for this limit no matter how z approaches Zp. 
Let us take, first, z = (ro + Az) + iyo, where Az is so small that z € U,(z0). 
Then we find that 


u(Zo + Az, yo) = u(Zo, yo) £4 Jim v(Zo ag Az, yo) = v(Zo0, yo) 


f (20) = Ree Ar Az—0 Ar 


= 2 (29,40) + im (0. 80) 
a 0, Yo Or To, Yo 
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But by taking z = ro + i(yo + Ay) we obtain, similarly. 


oe -1> (co, wo) rt (to, wo) (4) 


Thus at any point at which f has a derivative, we have 


Ou Ov 

dz Oy (5) 
and 

Ou Ov 

ade (6) 


These are the Cauchy-Riemann equations named for the Frenchman A. L. 
Cauchy and the German G. F. B. Riemann, two pioneers in complex function 
theory. 

It is clear that u(z,y) = x? — y?, v(x. y) = 2zxy satisfy these equations as 
do u(x, y) = e* cos y and v(z, y) = e7 siny. 


Suppose now that we have two real-valued functions u and v defined on 
R? that satisfy the Cauchy-Riemann equations at some point. Suppose further 
that u and v have continuous first and second partial derivatives at this point. 
Then equations (5) and (6) give: 


Oru C7 u 

Ox? ~ Oxdy (7) 
and 

O7u O*v 

Oy? — Bydex ss 


2, 
Under our assumptions, Baby = oe. hence u must satisfy 


Pu Pu ‘ ‘ 
5) x2 Bg Oy? rs ( ) 
A similar calculation shows that v must satisfy the same equation. Equation 
(9) is called Laplace’s equation or, sometimes, the potential equation. We have 
already seen four solutions to this equation, and our discussion above shows 
how we can generate many more (see Exercises 5, problem 4). This is a very 
important observation as we shall see. 


A function g: N — S (recall that N = {1,2,3,...}), where S is an arbitrary 
set, is called a sequence of points of S, or, more simply, a sequence in S. It is 
customary to set g(m) = $, for all n and to speak of the sequence {3,}%,. 
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Definition 4. Let {s,}&, be a sequence in K. We shall say that {s,} is 


convergent to the element so € K, and we shall write lim s, = so, if given 
n-—-dco 


€ > 0 there is a natural number no such that |s9 — syj| < € whenever n > ro. 
A sequence that is not convergent is said to be divergent. 


When f is a partial function from K to K’ and {s,} is a sequence in the 
domain of f, then {f(s,)} is a sequence in the range of f. We can use the 
behavior of sequences constructed in this way to characterize those points at 
which f is continuous. 


Theorem 6. Let f be a partial function from K to K’ and let s9 be an 
interior point of the domain of f. Then f is continuous at so if, and only 
if, lim f(sn) = f(so) whenever {s,,} is a sequence in the domain of f that 
converges to So. 


Proof. Suppose, first, that f is continuous at so and that {s,,} is a sequence 
in the domain of f that converges to so. Given € > 0, we use the continuity of 
f to find 6 > 0 such that |f(so) — f(s)| < € whenever |s — s9| < 6; remember so 
is an interior point of the domain of f. Next we use the fact that tim Sn = 80 


to find a natural number no such that |s, — s9| < 6 whenever n > no. Thus for 
the given € > 0, we have |f(s,) — f(so)| < € whenever n > no, and this means 
{f(sn)} is convergent to f(so) as claimed. 

Now suppose that lim f(so) = f(s9) whenever {s,,} converges to so and yet 
f is not continuous at this point. We shall deduce a contradiction. Negating 
the definition of continuity at s9, we see that for some € > 0 we can, for every 
5 > 0, find an element s,5 in the domain of f for which |ss — so| < 4 and yet 
|f(ss) — f(so)| > €. We now construct a sequence as follows. Setting 6 = 1 
we find s, such that |s; — sol < 1 and |f(s:) — f(so)| > €. Next we set 6 = 3 
and find s2 so that |s2 — so| < 5 and |f(s2) — f(so)| > €. We continue in this 
way. After s;,52,...,5n—1 have been found, we set 6 = + and find s, so that 
Isn — So| < + and |f(sn) — f(so)| > €. It is clear that we have constructed a 
sequence {s,,} that converges to so. But it is also clear that { f(s,)} could not 
converge to f(so) because |f(s,) — f(so)| > € for all natural numbers n. This 
is our contradiction. 


Definition 5. A sequence {s,,} C K is called a Cauchy sequence if, given any 
€ > 0, there is a natural number ng such that |s, — s,,| < € whenever m and n 
both exceed no. 


If {s,,} converges to so, then because |sn — 8m| < |8n — 30| + |S0 — Sm| we 
see that this sequence inust be a Cauchy sequence. For the set R, it is in fact 
the case that a sequence converges if, and only if, it is a Cauchy sequence. This 
statement is equivalent to the completeness axiom for R (Section 3, paragraph 
just after equation (8) ). It follows easily that a sequence in C is convergent if, 
and only if, it is a Cauchy sequence (see Exercises 5, problem 1). 
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Exercises 5 


1. Let {z,,} be a sequence in C, and for each n let 2, = Tp + 1yn, where 2, 
and y,, are in R. 


(a) Show that {z,} is a Cauchy sequence if, and only if, both {z,} and 
{yn} are Cauchy sequences. 
(b) Show that lim Zn = Zo + iyo if, and only if, limz, = zo and 


lim Yn = Yo- 


«2. If the function f is differentiable at a point 89 € K, show that it is 
continuous at this point. 


«3. Suppose f: R — C. Then we may write f(z) = g(x) + ih(z), where g 
and A map R to R. 


(a) Show that lim f(x) = uo + ivo if, and only if, we have both 
r—2Io 


lim g(z) = ug and lim A(z) = wo. 
r—2rg r—2r9 


(b) Show that f is continuous on [a,}| if, and only if, both g and h 
are continuous on this interval. So f f(x)dz can be defined to be 
f° g(z)dx + if? h(z)dz, and we shall do so. 

(c) Show that f is differentiable at a point Zo if, and only if, both g and 


h are differentiable at this point. Show that, when this is the case, 
f'(z0) = 9' (Zo) + th"(z0). 


4. (a) The function f(z) = 2° is differentiable at every point in C. Find 
the real and imaginary parts of this function and show that together 
they satisfy the Cauchy-Riemann equations. Show that each of these 
functions satisfies Laplace’s equations. 


(b) Do the real and imaginary parts of the function f(z) = Z satisfy the 
Cauchy-Riemann equations? 


5. Let f be a partial function from R — K’ with domain a closed interval 
[a,b]. Show that f is continuous on [a, 6] if, and only if, we have (i) f 
is continuous on (a. b); (ii) lim f(zn) = f(a) whenever {z,} ¢ (a, 6] and 
lim 2, = a; (iii) lim f(2,) = f(b) whenever {z,,} C [a, 6] and limz, = 6. 


6. Let f be a partial function from K to K’ that is uniformly continuous 
(Definition 3) on its domain. 
(a) Show that {f(s,)} is a Cauchy sequence whenever {s,} € D(f) is 
a Cauchy sequence. 
(b) Show that f(x) = 1, D(f) = (0,1), is not uniformly continuous. 
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#7. Let {nx}?2, be a sequence of natural numbers such that ny < nx+, for 
all k. Given a sequence {s,,}, the sequence {s,, } is called a subsequence 
of {s,}. 


(a) Show that all subsequences of a convergent sequence are convergent 
to the same limit. 


(b) Show that any subsequence of a Cauchy sequence is a Cauchy se- 
quence. 


(c) If {s,} is a Cauchy sequence and if it has a subsequence that con- 
verges, show that {s,,} is convergent. 


(d) If {s,} is a Cauchy sequence, show that there is a positive real 
number M such that |s,,| < M all n. 


Notes 


1. Cardano’s book, Ars Magna, published in 1545, has been translated by T. 
Richard Witmer (The Great Art. M.I.T. Press, Cambridge, MA, 1968). 
On page 8 Cardano states that the formula for solving a cubic was found 
by Scipione del Ferro and, independently, by Ta’taglia. He also states 
that the formula was given to him by Ta’taglia. Nowhere does he claim 
it as his own. 


2. Bombelli’s work, and a comment from him on how he was led to these 
ideas, can be found in Number—the Language of Science, by Tobias 
Dantzig (Doubleday and Co., Inc., Garden City, NY, 4th edition, 1956). 
He discusses Bombelli’s derivation of the laws of complex arithmetic 
and also the work of Harriot (see pp. 184-186 and p. 188). Euler’s quote 
about the impossibility of taking square roots of negative numbers can 
be found on page 191. The geometric interpretation of complex numbers 
by Wessel and later by Argand is discussed on page 101. 


3. Ralph P. Boas, in his wonderful book, Invitation to Complex Analysis 
(Random House, Inc., New York, 1987), states that the geometric inter- 
pretation of complex numbers was also found by Bessel in 1799 (see the 
notes on p. 13). 


Chapter One 


Classical Harmonic 
Analysis 


The central question in this subject first arose in the 18th century when 
D’Alembert, Euler, and Bernoulli investigated the problem of the vibrating 
string. We imagine an elastic string, of finite length, stretched along the x-axis 
with both end points fixed. If the string is displaced to form a curve, say f(z), 
and then released, the y-coordinate of each of its points will vibrate (i.e., vary 
with time), and because the end points are fixed, y depends on z. So we’ll have 


a y(z, t) (1) 


with y(z,0) = f(z) and f(z) is known. It was soon discovered that the function 
y(z,t) could be found provided one could represent f(z) as a series of sines and 
cosines, i.e., as a trigonometric series. More explicitly, provided one could find 
constants do, @),@2,--- and 6,,62,--- such that 


f(z) = 500 + 5 7(an coana + 6, sinnz) (2) 


n=1 


The details of the analysis connecting the physical problem and the series 
representation are discussed later. 

Many writers begin their discussions of classical harmonic analysis with 
the series (2). We believe, however, that it is more instructive to stay closer to 
the historical development. So we shall begin with a problem of inathematical 
physics. A problem whose solution is, we believe, especially illuminating. 
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1. The Dirichlet Problem for a Disk 


Before we can state the problem we have in mind we need some terminology. 


Definition 1. Let C be a circle in the plane (i.e., R*) and let G be the set of 
all points inside C’. A real-valued function u defined in G is said to be harmonic 
in G if (i) u has continuous partial derivatives of order one and of order two at 
every point of this set. and (ii) at each point of our set u satisfies the equation 


gt + Ee =0 


The equation just written is called the potential equation or, sometimes, 
Laplace's equation. In Chapter Zero (Section 5) we gave examples of harmonic 
functions and indicated how one can find many others. These functions have 
some remarkable properties. 


Theorem 1. Let C be a circle of diameter d in R? and let G be the set of all 
points inside C’. Suppose that u is a real-valued function that is defined and 
continuous on G UC and is harmonic in G. Then the values of u in G cannot 
exceed the inaximum of u on C, nor can they be less than the minimum of u 
on C’. 


Proof. Let m be the inaximum of u on C, and suppose that at some point 
(to, yo) € G, u(zo, yo) = M > m. We may suppose that M is the maximum 
of u on GUC. Now translate the origin to (zo, yo). We obtain a function we 
shall denote by u that has all of the properties of our original function and has 
maximum m over C' and maximum M over GUC but attains this last value 
at (0,0). Now define an auxiliary function v(z, y) = u(z, v) + Mf sag (x? + y?). 
Clearly, v(0,0) = M and, if (z,y) € C, then v(z, y) < m+ (4 Mom\ Mim < 
M. 

Thus v attains its maximum over GUC’ at some point of G. But, at every 
point of G 


a a M—m _oM-m 
Oz? By? Ox? Oy? d? dt 


This is a contradiction because, at the maximum of uv, none of its second partials 
can be positive. 

To prove the second statement of the theorem we apply the result just 
obtained to the function —u(z, y). This proof is due to the Russian mathemati- 
cian I. I. Privalov. 


>0 


The problem we shall investigate here, and in the remainder of this chapter, 
is this: Let C be a circle in R? and let G be the set of all points inside C. Suppose 
that we are given a real-valued function f that is defined and continuous on 
C’. Can we find a function u that is harmonic in G, continuous on GUC, and 
equal to f on C” 
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This is called the Dirichlet problem (or, simply, the DP) for C’ with bound- 
ary data f. Theorem 1 has an immediate and, as we shall see, very important, 
consequence. 


Theorem 2. Solutions to the Dirichlet problem for a circle are unique. 


Proof. Using the notation introduced above, suppose that u,; and u2 are two 
solutions to the DP for the disk G U C with boundary data f. Then the 
function u); — ug is harmonic in G, continuous on G U C, and identically zero 
on C. By Theorem | this function must be identically zero at all points of G. 
Thus u,; = ug on GUC. 


Exercises 1 


«1. We shall solve the DP for the unit disk, i.e., the disk whose boundary is 
the circle of radius one centered at the origin. To do that it is convenient 


to use polar coordinates: x = rcos@, y = rsin@. Show that under this 

Oru Oru _ ob Ou 1 Ou 1 u 

dz2 * Oy? ecomes 3 + By. 72 OR 

2. Let C be a circle in the plane and let G be the set of all points inside 
C’. Suppose that f,. fo are two real-valued functions that are defined and 
continuous at eacli point of C and that these functions differ by less than 
some € > 0 at each such point. If uw; and ug are solutions to the (DP) for 
GUC with boundary data f, and fo, respectively, show that u; and u2 
differ by less than ¢ at each point of G. 


transformation, 


2. Continuous Functions on the Unit Circle 


The boundary data for the DP are continuous functions on a circle. Such func- 
tions have some interesting properties that we shall explore here. Recall that 
K denotes either the set of real numbers, R, or the set of complex numbers, C. 


Definition 1. For any function h: R — K, the set P(h) = {a € Rjh(z+a) = 
h(z) for all z} is called the set of periods of h. We shall say that h is a periodic 


function when this set contains a nonzero number. 


For any function h the set P(h) contains the number zero, and in certain 
cases it contains nothing else. For example, when A(x) = z or h(z) = e7, 
we have P(h) = {0}. At the other extreme, we have P(h) = R when h(z) is 
a constant; so constant functions are periodic. Less trivial examples are the 
functions sinz and cosz. We recall that 


P(sinz) = P(cosz) = {2nz7|neé Z} 


The set of periods of a function has some structure. 
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Lemma 1. For any function h: R — K we have (i) the number (—a) € P(h) 
whenever a € P(h), and (ii) if a.f are in P(h), so are both the numbers 
at¢tBa-—JZ. 


Proof. When a € P(h) we have h(x+a) = h(z) for all z. Hence, when z = y—a 
we have h(y) = h(y — a), and this is true for any y. 

Given a, 3 in P(h) we see that h[z+(a+)] = h[(z+a)+ 8] = h(r+a) = 
h(x) for all :r, first because 6 € P(h) and then because a does. The fact that 
(a — 3) is in P(h) follows from what we have just shown. 


Corollary 1. Ifa € P(h), then {na |ne€ Z} C P(h). When a is the smallest 
positive member of P(h), these two sets coincide. 


Proof. To show that P(h) D> {na | a € Z}, it is sufficient to show that 
P(h) D {na | n € N}. If this last statement is false, then {n € N | na ¢ P(h)} 
is nonempty, and hence it contains a smallest inember ng. From lemma | we 
see that no > 2, and so (mg — 1) € N. Furthermore, (no ~ 1)a must be in P(h). 
But then, again using lemma 1, (mg — 1)a + a = nga is in P(h), which is a 
contradiction. 

Now suppose that P(A) has a smallest positive element a. We want to show 
that P(h) = {na | n € Z}. Given any 8 € P(h) suppose, first, that 8 > 0. 
Then a < # and we can find an integer k such that 2 = ka+r, O< r <a. But 
both 3 and ka are in P(h). hence r € P(h), and so this number must be zero. 
Thus 2 = ka, k € N. When @ is negative, then (—3) € P(h), and this number 
is positive. So, (—3) = ka for k €N, telling us that {3 = (—k)a. 


A constant function is, obviously, a periodic function with no smallest pos- 
itive period. A less trivial example is given in Exercises 2 (problem 2). 

When a periodic function h has a smallest positive period, this number is 
sometiines called the period of h, or the Period of h. We shall call every element 
of P(h) a period of h, and when we need to talk about the smallest positive 
member of P(h) we shall refer to it as such. 


Lemma 2. If a continuous function on R has arbitrarily small periods, it is a 
constant. 


Proof. We shall prove this result for real-valued functions and leave the case 

of complex valued functions for the exercises (see Exercises 2, problein 6d). 
So let f : R — R be a continuous function and let {a,,} C P(f) be positive 

numbers such that tim Q@,, = 0. Now for each fixed n and any real number z, 
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there is, by the mean-value theorem for integrals (Chapter Zero), a real number 
z* such that 


zrt+an 
/ f(t)dt = f(2%)[(2 + on) — 2] = f(et)an 


and z < 27, < X+ any. Thus 


1 zrt+an 


=< f(t)dt = f(z,) 


an Jr 
Now f is continuous and a, — 0, hence 


r+Q,y, 


lim Es f(t)dt = f(z) 


N—0O An 


However, because a, € P(f) we have, for any z and y, 


= f(t)dt = er ae 


(see Exercises 2, problem 3). Thus letting n — oo, we find that f(z) = f(y) 
showing that f is a constant. 


Corollary 2. Any nonconstant, continuous, periodic function has a smallest 
positive period. 


Proof. Again we shall prove our result only for real-valued functions. So let 
f:R — R be nonconstant, continuous, and periodic. By lemma 1, the set 
P,(f) = {a € P(f) | a > 0} is nonempty. Let ao be the greatest lower bound 
for this set (Chapter Zero). If ao is zero, then a sequence in P,(f), a sequence 
of positive periods of f, must converge to it; i.e., f would have arbitrarily small 
periods. This cannot be the case, because f is not constant, and so ag must be 
positive. 

To complete the proof we need only show that ao € P;(f). We argue by 
contradiction. If ag is not in this set, then we must have {a,} C P,(f) such 
that lim,-..5 @n = Qo. But then, because f is continuous, we have f(z + ao) = 
tim (2+ an) for any fixed xr. However, f(r+a,) = f(z), so f(xr+ao) = f(z) 


for any z, and this says ag € P(f). 


Combining our results we see that if f is a nonconstant, continuous, periodic 
function, then f has a smallest positive period ap and P(f) = {nag | ne Z}. 


In all that follows we shall denote the unit circle (the circle of radius one 
centered at the origin) by T. We shall solve the DP for this circle. Note that if f 
is a continuous function on T, then we may define f(6) to be f(cos@, sin @) for 
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all 6 € R. It is clear that f is a continuous function on R and that 27 is a period 
of this function. Thus every continuous function on T gives us a continuous 
function on R that has 27 for a period. Conversely, given a continuous function 
f on R that has 27 for a period, we may define f(cos 6, sin9) to be f f(0), for 
all 6, obtaining a function f that is continuous on T. 


Definition 2. A function that has 27 for a period, whether or not it is the 
smallest period of this function, is called a 27-periodic function. 


Exercises 2 


1. 
2. 


* 3. 


* 4, 


x 6. 


Let f: R — K. Show that P(f) = R if, and only if, f is constant. 


Recall the set of rational numbers Q. Define x(x) = 1 when z € Q, 
x(x) = 0 when z € R\ Q. Show that P(x) = Q; hence, x has no smallest 
positive period. 


Let f: R — R be continuous and periodic and let a > 0 be any element 
of P(f). Show that for any real numbers z and y we have f-** f(t)dt = 
vr" F(t)dt. 


Let a, b be two fixed complex numbers and let 4 be a positive real number. 
Let g(z) = acos Az + bsin Az. Show that P(g) = {222|n € Z}. 


. Let f,g be two periodic functions. 


(a) If the set P(f)N P(g) contains a nonzero element, show that af + bg 
(here a, b are any complex constants) and f-g are periodic functions. 


(b) When f has a smallest positive period a and g has a smallest positive 
period 8, show that P(f)M P(g) contains a nonzero number if, and 
only if, a = rf for some rational number r # 0. 


Let h: R-~C 


(a) Show that there are functions f,g, both from R to R, such that 
h(x) = f(z) + ig(x) for all zx. 

(b) Show that P(h) = P(f)M P(g). Hence if h is periodic, then both f 
and g are periodic; the converse is false. 

(c) Show that h is continuous if, and only if, both f and g are continuous. 


(d) If h is continuous and has arbitrarily small periods, show that h is 
a constant. 


(e) If hk is continuous, periodic, and nonconstant, show that h has a 
smallest positive period. 


«7. If f: R — C is continuous and periodic, show that it is uniformly contin- 


uous on R (Chapter Zero, Section 5, Definition 3). 
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3. The Method of Fourier 


The DP requires that we solve a certain partial differential equation subject toa 
given boundary condition. The mathematical details are a bit complicated, and 
it is wise to begin with a simpler problem involving the solution of an ordinary 
differential equation. The relevance of this simpler problem will become clear 
very soon. 

We begin with the equation 


60) + 4O(8) = 0, a real constant (1) 


We seek nontrivial (i.e., not identically zero) solutions to (1) that are 27- 
periodic. There are three cases: 


(i) ys = 0. Here the general solution to our equation (6 (8) = 0) is easily 
seen to be 


0(8) = co9 + ao 


where co and do are constants. This is nontrivial and 27-periodic only 
when Co is zero and Qo is not. 


(ii) ye < 0, say ys = —A? where 4 is a positive real number. So (1) becomes 
@”(@) — 128(6) = 


We set 6(0) = e”®, m to be determined, and putting this into our equa- 
tion, we find that (m? — \?)e™® = 0. It follows that m = +A and the 
general solution to our equation is 


(8) = cye*? + cge7*? 


Clearly, there is no choice of the constants c, and cg that gives us a 
function that is both nontrivial and 27-periodic. The problem has no 
solutions when pz < 0 (see Exercises 3, problem 1). 


(iii) > 0, let us say p = A? for some A > 0. Again putting 6(0) = e”® 


into our equation (i.e., into @”(6) + \2@(0) = 0) we find that m = +iX. 
Hence, the general solution is 
@)(@) = ce, 67°? + co,e7 9 = ay cos AO + by sin AO 


where a), 6) are arbitrary constants, not both zero. We are seeking solu- 
tions that are 27-periodic. Now 


={n (= = |ne Z} (Exercises 2, problem 4) 


and so this condition is satisfied if, and only if, \ is a natural number. 
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To sui up, equation (1) has nontrivial 27-periodic solutions when, and 
only when, yp = n*7, n =0,1,2,--- and these are 


0,,(0) = a,, cos né + 6,, sin nd (2) 


Let us turn now to the DP for the unit circle 7. The geometry suggests that 
we use polar coordinates, and we let. G be the set of all points inside T. Thus 
we are given a real-valued function f(@) that is continuous on 7, and we seek 
a function u(r.@) that satisfies 


du 1 du, 1 Oy 
Or? r Or xr? 06? 


at all points of G, that is continuous on G UT, and is equal to f(@) on T. 

Now equation (1) has, depending on p, a general solution that can be 
represented by a simple formula. Equation (3), however, has so many particular 
solutions (Chapter Zero) that we cannot expect to represent all of them in any 
simple way. Another approach is needed, and the standard one is to restrict 
ourselves to solutions u(r,@) that can be written in the form R(r)@(@). This 
may seem like a drastic assumption and a severe limitation on what we might 
find. One might wonder if there are any solutions that can be “factored” in this 
way. And, even if there are such solutions, might there not be other perhaps 
more interesting solutions that cannot be so written? Are we to ignore these? 
Both objections disappear, however, once we show that our restriction does in 
fact lead us to a solution, because, as we have seen (theorem 2 of Section 1), 
solutions to the DP for a circle are unique. This is what justifies our entire 
approach. 


So let us set u(r, 6) = R(r)O(6) and put this into (3). We find, after some 
algebra, 


=0 (Exercises 1, problem 1) (3) 


r?R’” +rR' m 6” 


R cy 
Because r and @ are independent variables, it follows that both expressions 
are equal to the same constant, say, 4. Thus we have two ordinary differential 
equations 


rR" (r)+rR'(r) — nR(r) = 0 (4) 

and 
6"(0) + pO(6) =0 (5) 
For each fixed jz we find the general solution to (4) and to (5) and their product 


is a solution to (3). However, we want u(r,@) = R(r)O(@) to be equal to the 
given function f(@) when r = 1, i.e., on T. This says that the solutions to (5) 
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that are of interest here must be 27-periodic. Our discussion at the beginning 
of this section shows that we will find such solutions when, and only when, 
p. = n* where n = 0,1,2,--- and that these solutions are 


6,,(0) = a, cosné + by, sinnd (6) 
Equation (4) now becomes 
r?R"(r)+rR(r) —n?R(r) =0 (7) 
and its general solution, when n # 0, is easily seen to be 
R(r) = cr" + cor” (8) 


(see Exercises 3, problem 2). Because R(r)6(@) is to be harmonic inside the 
unit circle, and cgr~”" is undefined at the origin, we must take cp = 0. When 
n = 0, the general solution to (7) is also undefined at the origin and hence 
cannot be harmonic (see Exercises 3, problem 3). 

We have found a whole class of solutions to (3) that are 27-periodic when 
r= 1. They are 


r™(a, cos nO + b, sin n@), n= 0,1,2,3,--- (9) 


It would be remarkable if any one of these solutions was equal to the given 
function f(@) on T. However, because (3) is linear (meaning sums and scalar 
multiples of solutions to this equation are again solutions), the early inathe- 
maticians simply “added” them all up to obtain 


of 
u(r,9) = ag + > r” (dy cos nO + b, sinn@) (10) 


n=l 


and then asked if one could choose the constants a, and 6,, so that 


f(@) = u(1, 0) = agp + (an cos n@ + 6, sinn@) (11) 


n=l] 


This is the problem that so troubled D’Alembert, Euler, and Bernoulli. Modern 
mathematicians have learned to be wary of infinite sums. We immediately ask 
about the meaning of (11). Some form of convergence is implied, but conver- 
gence in what sense? 


Exercises 3 
1. Let A be a fixed, positive, real number. 


(a) Show that ae*? + be~*7 = 0 for all z if, and only if, a = 6 = 0. 
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(b) Set f(x) = ce + coe~** where c), cz are constants not both zero. 
Show that P(f) = {0}, i.e., if f(z +a) = f(z) for all z, then a = 0. 


«2. Use the substitution r = e' to show that equation (7) is equivalent to 
R-n?R=0 where R= < R. Next find the general solution to (7). 


«3. Solve the equation r?R”’(r) + rR'(r) = 0. 


. Ou Ou Ou Ou ; 
4. r r ¢ [— —_—_ = —_— -— —— = tt 5 ae = 
4. Solve the equations Az + ay 0 and an 3 O by setting u(z, y) 
X (x)¥ (y). 


4. Uniform Convergence 


The basic facts about uniformly convergent sequences and series are probably 
known to inost readers. A quick review of the main theorems may, however, be 
helpful. 


Definition 1. Let D C R bea nonempty set and, for each n € N, let f,,: D— 
K. We call {f,} a sequence of functions in D. We shall say that our sequence 
converges to the function g: D — K uniformly over D if, given any € > 0, there 
is a Natural number N such that 


\9(z) = fn(z)| <e€ 


for all z € D. whenever n > N. 
Our first result, though not terribly surprising, is useful. 


Lemma 1. Let DC R bea nonempty set. A sequence of functions { f,,} in D 
converges uniformly over this set if, and only if, to each e > 0 there corresponds 
a natural number N such that 


lfn(z) — fm(z)| < 


for all x in D, whenever both m and n exceed N. 


Proof. Suppose that our condition holds for the sequence { f,,}. Then for each 
zé D, {f,(z)} C K is a Cauchy sequence (Chapter Zero, Section 5, Definition 
5). Any such sequence converges to a point of K that we shall denote by g(z). 
Now given € > 0 we can find a natural number N such that |f,(r) — fm(z)| < € 
for all s € D, whenever both m and n exceed N. Letting m — oo in this 
inequality gives us |f,(x) — g(x)| < € for all z € D, whenever n > N. But 
this says {f,,} converges to g uniformly over D, and the proof is complete in 
this case. 
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Now suppose that the given sequence {f,} of functions in D converges to 
a function g uniformly over this set. Then, given ¢ > 0, we can find a natural 
number N such that |g(z) — fn(z)| < €/2 for all z in D, whenever n > N. 
But then |fn(z) — fm(z)| < |fa(x) — g(x)| + lo(z) — fm(z)| < € for all z in D, 
whenever both m and n are > N. Thus {f,} satisfies our condition. 


Given a sequence {u;} in D we may construct another sequence in this set 
by letting 


fi(z) = ui (x) 
fo(z) = uy; (xr) + uo(z) 


oeeee e © © © © © © © © © & & 


eee e® @ @ © @ © @ © @ © @# 8 ®8 @ 


When {f,} converges to a function up uniformly over D we say that the 
series 


» u;(z) 


OS 
is uniformly convergent to uo(z) over D. We sometimes write ug(z) = > u,(zx). 
j=) 


Lemma 1 has an important consequence: 
Corollary 1. (Weierstrass M-test) Let {u;} be a sequence of functions in D 
and let {4f;} be a sequence of positive numbers. Suppose that 
(i) [u;(x)| < M; for all z € D, and each fixed j; 


oO 
(ii) > M; is convergent. 
j=l 


fo a) 
Then the series ys u,;(x) converges uniformly over D. 
j=1 


Proof. Because © M; is convergent we can, given € > 0, choose a natural number 
N such that 


m n 
So M;-S > M|<e 
j=l j=l 
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whenever both m and n exceed N. Thus if m > n, 


> M; <e 


j=nt+l 


11} 
whenever n > N. Now, for each n, let f(z) = > u;(z). 
j=l 
Then, when m > n, 


lfm(z) — fa(z)| =| ¥— uj(z) - S— u,(z)| 


j=1 j=l 


=| S> uj(z)ls S> lus(z)| 


j=nt+1 j=n+l1 


by (i). Combining this with our first observation we see that 
lfn(z) — fm(z)| < € when m>n>QN. 


By Lemma | the sequence {f,}, and hence the series Du,;(x), converges uni- 
formly over D. 


There are theorems that tell us when a certain property that is shared by 
each term of a convergent sequence of functions is also shared by the limit of 
this sequence. The main applications of these results, for us, are to series of 
functions that are defined on some interval of R and converge uniformly over 
this interval. So let [a,b] C R, let u;: [a,b] + K for each j = 1,2,..., and, for 

n 


each n, let fn(z) = >— u;(z). We shall assume that {f,} converges to uo(z) 


=) 


uniformly over [a, 8}. 


Lemma 2. If zo € (a,5] and each u;(z) is continuous at this point, then uo 
is also continuous at this point. In particular, if each u; is continuous on [a, 6}, 
then ug is continuous on this interval. 


Proof. First suppose that zo € (a,b). We may write 
|uo(z) — uo(zo)| < |uo(z) — fn(z)| + |fn(Z) — fn(Z0)| + |fn(z0) — uo(zo)| 
Now suppose that ¢ > 0 is given. We first choose, and fix, f, so that 


Iuo(z) - fa(z)l < = 
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for all x in (a, 6]. For this n we use the continuity of f, at zo to choose 6 > 0 
such that 
\fn(z) ~ fn(z0)| < = 
whenever |z — Zo| < 6. It then follows that 
|uo(x) — uo(zo)| < € 
whenever |z — Zo| < 6, which shows that ug is continuous at Zo. 


The cases 29 = a and Zo = b are left to the reader. 


Lemma 3. If each of the functions u;(x) is continuous on {a, b], then 


7 uo(z)dz = [ (5 oe) dr = mH u;(x)dx 


Proof. By lemma 2 the function uo(z) is continuous on [a,b], and hence its 
Riemann integral exists. Now 


6 6 6 
| | fa(z)dz - | uio(x)do| < : Mfa(a) — ug(2)|dz 


and because {f,} converges to up uniformly over [a,b] we can, given € > 0, 
choose a natural number N so that 


\fn(2) — uo(2)| < -— 


for all x € [a,b], whenever n exceeds N. Thus 


b b n pb b 
|| fu(z)az ~ | ug(x)dz| “ID uj(z)az — | uo(x)dz| < € 


whenever n > N. This proves the lemma. 


One cannot, in general, find the derivative of uo(z) by differentiating the 
series Du,(xr) term-by-term. In fact, uo(z) may not have a derivative even when 
every u;(z) does (see below). However, we do have the following: 


Lemma 4. Suppose that each of the functions u;(z) is continuous on [a, }], 
and has a continuous derivative at every point of (a,b). Suppose further that 
the series 


> uj (z) 
j=! 
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converges to a function v(x) uniformly over (a, 6). Then uo(z) is differentiable 
on this (open) interval and 


u(x) = v(z) 
at each x € (a, 6). 


Proof. By our assumptions each of the functions 


n 
falz) = D> u;(2) 
j=1 
is continuous on [a, b] and has a continuous derivative on (a,b). Moreover, the 
sequence { f/ (x)} converges to v(z) uniformly over (a, 6). It follows, from lemma 
2, that v(z) is continuous on this interval. 


Now given x € (a,b) we can choose a number zo < z such that [zo, z] C 
(a,b). Then by lemma 3 


F u(t)dt = im, f fi (t)dt = lim {fn(z) — fn(z0)} = uo(z) — Uo(zo) 


Zu 


Thus 


ug(x) = [ v(t)dt + uo(Zo) 


G 


It follows that uo is differentiable at x and that 


Up = v(z) 


A uniformly convergent series of even such well-behaved functions as sines 
and cosines can converge to a function with very strange properties. The fol- 
lowing examples illustrate this point. 

It is said that in a lecture given in 1861, Riemann asserted that the function 


“ sinn?arr 
me) = 5 S00 
n=l 


is continuous at every z but does not have a derivative at any point. The M- 
test shows that the series converges uniformly over R and so the limit function 
is certainly continuous by lemma 2. When Karl Weierstrass heard about Rie- 
mann’s claim, he tried to prove that R(x) was nowhere differentiable. He was 
unable to do so but, in the course of trying he came up with the following set 
of functions, 


© 
Wir(r) = >; a" cos b° xz 


n=1 
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and was able to show that when 0 < a < 1, b is an odd integer and ab > 1+ Sr, 
these functions are nowhere differentiable. Again the M-test shows that thie 
series converges uniformly over R when 0 < a < 1; hence, the associated func- 
tions are continuous. In 1916, G. H. Hardy improved on this result by showing 
that the functions W,.,(x) are nowhere differentiable when 0 < a < 1, b> 1, 
and ab > 1. Hardy also showed that R(z) has no derivative when zx is an irra- 
tional number or a rational number of a certain kind. Surprisingly, J. Gerver 
proved that R(z) does have a derivative at certain other rational points. This 
he did in 1969 while he was an undergraduate student at Columbia University. 


Exercises 4 


For problems | and 2, T is the unit circle and G is the set of all points that are 
inside T. 
+1. Suppose that we have a sequence of functions each of which is continuous 


on G UT and harmonic in G. If this sequence converges uniformly over 
T, show that it converges uniformly over GU T. 


+2. (a) For any fixed n show that r” (a, cosné+ 6, sinn@) is harmonic 
in G. 
(b) For p fixed, 0 < p < 1 show that >> p", > np", and 5° n?p" are 
convergent. 


(c) Suppose that, for some M,|a,| < M and |b,| < M all n. Set 
u(r,0) = yr (a,, cos n@ + 6, sinn@) for all (r,8) € G. Show that 


n=1 
u(r,6) is harmonic in G. 


OO 

3. Recall that, for a given sequence ay, a),... of real numbers, ye a,x" is 
n=0 

called a(real) power series. To any series there corresponds an R, 0 < 


R < oo. such that > a,x" converges for all x with |z| < R and diverges 
for all z with |z| > R. R is calted the radius of the series. 


(a) For each fixed p, p < R, shows that the series converges uniformly 
on [—p. 9}. 


(b) For each z € (—R, R), define f(z) to be > a,x", and show that f 
n=0 
is continuous on (—R, R). 


(c) If [a,b] < (—R, R) show that 


[seve = [ (3: on") dr = Yio [ve 


n=0 
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l—z 


oS 
(d) Show that » zx” converges to ;— at each point of (—1, 1), but that 
n=0 
convergence is not uniform over this interval. 


oS 
4. Suppose that > a,x" has radius R, R # 0 and suppose that the series 


n=0 
oo) 


) na,x"~' has radius R’. 


n=] 


(a) If |x| < R. choose zo so that |2| < |zo| < R and show that for some 
constant, call it A, we have |a,2z5| < A for all n. 


(b) With x and zo as in (a) write 


and conclude that 


A 
\na,z” | < —nr" 


lzo| 


where A is the number found in (a) and r = |=|. 


oo 
(c) Show that 2 A apt is convergent and conclude from this that 
n=1 


— |Zo| 
> na,xr" ! converges whenever |:r| < R. Thus R < R’. 
(d) Now suppose that R < |z| < R’ and note that 


x 2 
lanzr"| = jnanz” "||| < |nanz” J 


when |z| <n. Deduce a contradiction and conclude that R’ = R. 


co 
(e) Show that 2a Qnx” is differentiable at every xz € (—R, R) and that 


n=0 


oS 
its derivative is Dy nenr”?. 


n=) 
co oo 
5. Suppose that we have two power series, > Qnxr" and >~ b,x”. Suppose 
n=0 n=0 


that for some r > 0 we have DS anzr” = S| b,x" whenever |z| < r. Show 
that a, = b, for n =0,1,2,.... 
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5. The Formulas of Euler 


The boundary data for the DP is a real-valued function that is continuous on 
the unit circle, on T. Let us denote the set of all such functions by C,(T). So 
every element of C,(T) is a real-valued function that is continuous on T or, 
equivalently, a real-valued function that is defined on R is continuous and is 
2x-periodic. Our discussion thus far has led us to ask if every f € C,(T) can 
be “represented” by a trigonometric series. In 1777 Euler made the first real 
progress toward answering this question. We shall present his results here. 
Let us suppose that the trigonometric series 


1 — 
50 + > (an cos né + b,, sin n@) (1) 


n=) 


converges to the function f(@) uniformly over R (or, equivalently, over T). 
We are supposing also that the coefficients in this series are real, so clearly 
f € C,(T). Now given € > 0, we can find a natural number N such that 


|f(8) - G ao + (an cos nO + b,, sinnd)}| < (2) 
n=1 
for all 6, whenever m > N. We may write, for each n, 
ind —~in8d in@ _ ,-in® 
cos n@ = iis Saas sin @ = : = 
2 2% 


Putting these expressions into (2) we obtain 


1F(8) - (50+ Do (5 sa) aad? > Ga nee eM ce 


or, setting co = $a0, Cn = 2958, c_, = Sativa for each n = 1,2,.... 


If(0)-— So cnet] <e (4) 


n=—m 


for all 8, when m > N. 


Definition 1. When DC R, D # 9, and we have a set of functions u; : 
co 


D ~— K, one for each 7 € Z, we say that the series > u,(x) converges to the 
j=-0o 

function v(z) uniformly over D whenever the sequence fo(x) = uo(z), fi(z) = 

u_i(x) + up(z) + w(z),...,fx(z) = L5__, uj(z),-.. converges to v(z) 
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uniformly over D. More precisely, we say that the series converges to v(z) uni- 
formly over D if, given ¢ > 0, we can find N such that 


u(x) - S> u(x) <e 


jan 


for allz € D, when n > N. 


The inequalities (2) and (4) show that the series (1) converges to f(6) 
uniformly over R if, and only if, the series 


ind é 
>> cne'”? (5) 


converges to this function uniformly over R. 


We began by assuming that (1) converged to f(@) uniformly over R, and 
we have just seen that this is equivalent to assuming that (5) converges to f(6) 
uniformly over R. The results we want can be obtained, very easily, from the 
latter series. 

Choose, and fix, an integer p. Then the series 


oo oo 
(3: cent) 8 =) cqeto-7 
—0o 


—0oo 


converges to f(#@)e—*?* uniformly over R; equivalently, uniformly over T'. Thus 
by lemma 4.3 we have 


f(0)e—*?? do = / (3: eit do 
—* —-wW —0o 
oo Ww 
= Sen | eiln—p)9 49 
—6Oo —* 


It is easy to evaluate these integrals. Clearly, 
/ " e'(n—P)6 dg — 0 


when n # p, and when n = p, the integral is equal to 27. 
Thus our last infinite series reduces to a single term: 27c,. And this gives 


pe Po / f(o)e*da Ss allpeZ (6) 
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Thus the coefficients in the series (5) can be computed from f (8). 


The coefficients in the series (1) can also be computed from this func- 
tion. We have ag = 2co, and a little algebra gives a, = Cy +C-n, On = 
Ue, 6.4) 9 10H Ny oes dss Hence (6) gives us 


Qn = : f(@) cos nd6, n= 0.1.2..%. (7) 
1 f” : 
b, = -{ f(@) sin n6d6, M1 2s. (8) 


and here we see why it is customary to put the ° before the apo. Without this 
factor the formula (7) would not be valid when n = 0. 


Given a function f € C,(T), the numbers obtained from f using formulas 
(7) and (8) are called the (real) Fourier coefficients of f. We sometimes denote 
them by a,(f), 6,(f). The trigonoinetric series (1) whose coefficients are the 
Fourier coefficients of f is called the (real) Fourier series of f. 

The numbers obtained from f by using (6) are called the complex Fourier 
coefficients of f and are sometimes written c,(f). The series (5) is called the 
complex Fourier series of f when its coefficients are the numbers c,(f). 


We may now ask whether the Fourier series of a given function converges to 
that function. This is a tricky matter. The discussion of these series is intricate 
and technically challenging, and it is not even possible to state many of the 
results unless we first discuss the Lebesgue integral. One of the major reasons 
for interest in that integral is the profound impact it had on the study of Fourier 
series. In the next few sections we discuss the convergence of the Fourier series of 
a function in C,(T) and use the results we obtain to solve the DP. A discussion 
of these series for more general functions is given in the next chapter. 

We end this section with an important theorem due to Riemann. 


Theorem 1. For any f € Ct) si Cn(f) = 0. 
n}—- oO 


Proof. Given f € C,.(T) we may write 
~en(f) = een(S) = so f : f(t)enn!- Dat 


In this integral we set u = ¢ — = to obtain 


-en(f) = 5 | 


pf 
x n 


w 
us m 


Ws -3 1 f"* 7 

~\e~inuy, — —)e7 itu 
flute du sz [fut Ze u 
where we have used problem 3 of Exercises 2. Now 


2en(f) = en(f) — [-en(F)] 
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and so we have 
1 . us —int 
den(f) = 5 f {f0t)- st + perineal 
From this we find that 
en 5 f_ UFC) - Fle+ Z)le*tlde < maz ist) - (e+ 2) 


Now f € C(T), and so this function is uniformly continuous on R (Exercises 2, 
problem 7). Thus, given « > 0 we can choose a natural number N such that 


T 
f(t) - f (t+ =) |< 2¢ 
whenever |nj > N, and this holds for any t € [—7, 7]. It follows that 
len(f)l Se 


whenever jn| > N, and this proves our theorem. 
Corollary 1. For any function f € C,(T) we have lim an(f) = 0 and 
jim | bn(f) = 0. 


Proof. Because an(f) = en(f) + c-n(f) and bn(f) = i{en(f) — c-n(f)} the 
theorem shows that Jim an(f)=0= tim bn(f). 


Exercises 5 


1. Define f(z) = x? for —7 < x < 7, and f(z+2m) = f(z) for all x. Clearly, 
fec,(T). 


(a) Compute the real and the complex Fourier coefficients of f and show 
that the real and complex Fourier series of this function converge 
uniformly over R. 


(b) Assuming that the real Fourier series of f converges to this function, 
ba 1 = (—1)"+! 
compute the numbers (i) dX 3 and (ii) > a 
n= n= 


*2. For f € C,(T) show that the Fourier coefficients of f are bounded; i.e., 
show that there is a real number p’ such that |a,(f)| < us’ and |bn(f)| < 
for all n. 


3. If f € C,(T) has Fourier coefficients {c,} and if f’ exists and is also in 
C,(T), show that | lim nc, = 0. If f” also exists and is in C,.(T), show 
n|—>oo 


that lim nc, =0. 
|n|—-00 
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6. Cesaro Convergence 


A modified form of convergence was introduced into mathematics by the Ital- 
ian mathematician Ernesto Cesaro. This notion seems, at first glance at least, 
somewhat contrived and, perhaps, artificial. A major result of Leopold Fejér 
(next section) shows, however, that this type of convergence is the natural one 
to use in connection with the Fourier series of a function in C,(T). 


Definition 1. Given a sequence ao, @), a2, ... in K the numbers o, = 
ap, 02 = 3%, og = Satsite2, _.. are called the Cesaro means of this 
sequence. We ‘shall say that {an} is Cesaro convergent to the number g if 
lity sco Cn = SC: 


One can easily show that a convergent sequence is Cesaro convergent to 
its (ordinary) limit. There are, however, sequences that do not converge in the 
ordinary sense that are convergent in this new sense (see Exercises 6, problems 
1 and 2). So Cesaro convergence “contains” and extends the ordinary notion of 
convergence. 

As one might expect, a series is said to be Cesaro convergent (or, sometimes, 
Cesaro summable) to the number s’ if its sequence of partial sums is Cesaro 
convergent to s . This can get a bit messy. The mth Cesaro mean, call it om, 


for the series De ay is 

—- OO 

ye ay + ae a, + Die -2 Gk 9 Sada eee ak 

on = SE Oi) 
mm 
We shall now derive an expression for the mth Ceséro mean of the complex 
Fourier series of a function f € C,,(T). It turns out that there is a nice, compact, 
expression for this mean in terms of an integral. 
Given f € C,(T) the nth partial sum of its complex Fourier series is 


sn(aif)= So a(fle* = So et E [ fluer**¥ay| (2) 


k=-n k=-n 


-/ “500 | elke ” dy 


k=—-n 
The mth Cesaro mean of this series is 
So(Z; Jf) + 8)(25f) t-°° + 8m-1(73 
aes; f) = 20EA) Fla) (af) 


— (3) 
Combining this with (2) gives us 


= dy (4) 


x eg CREW 4 te Ey ethlenw) 
Om(z;f) = | py) ae tt hin EY] 
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Observe that the integrand contains the mth Cesaro mean of the series 
oo 
\, etk(z-y) (5) 
k=-0900 


If we denote this mean by K,,(z — y), then (4) becomes 


Om(Z; f) a | f(y) Km(z _ y)dy, m= 1, 2,3: me (6) 


We shall use this expression in the next section. 


The functions K,,(z), m = 1,2,..., are the Cesaro means of the series 
> ~.. e'*?. These functions have some useful properties: 


Lemma 1. For each fixed m we have 


0) Kins) = (HR) =o (Sl) 


(ii) Ay(z) > 0 for all z 


ee. la 
(iii) on |. K,,(z)dz = 1 


Furthermore, if J is any open interval containing zero, then 


lim, sup{ Km(z) |2 € (-2,2]\ J} =0 


" 
Proof. Let us set s,(z) = B e'*= so that we may write 


-—n 


Sn(Z) + $n_1(Z) +--+ + $1(Z) + 80(Z) 


Kn+i(z) = nel 
and 

Kise 8n-1(T) + 8n-2(7) +++ + 81(Z) + 80(z) 
then 


(n + Ll) Kygi(z) —nKy(zr) = 8n(z) = Seite = Seite + Sette (1) 
-n 0 ] 


These last two series are (finite) geometric series and can be summed to give 
12 ei(n+])z 


1 — e'* 
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and 
l— ea tM(ntl)z 
l-e- 
respectively. Adding these expressions and doing a little algebra gives us 


: cosnz — cos(n + 1)zr 
(n+ 1)Kn4i(z) —nK,(z) = cone eee (2) 
Now K,(x) = 1 for all x, and putting this into (2) we find K»2(r). Putting 
Ko2(z) into (2) gives us K3(z), and so on. In this way we find that 


1 {1-—cosmz 
K(x) = — — 
— COSE 
which is the first expression in (i). Using our knowledge of the trigonometric 
functions we can transforin this into 


2 
l (= an) 
oe a | 
m \ sin dz 


which is the second expression in (i). Note that this proves (ii) as well. 


To prove (iii) we consider the function that is 1 for all z. The Fourier 
coefficients of this function are 
l us 


ee le~*** dr 
on |. e x 


Ck 
for k € Z. Clearly, c, = 0 when k # 0, and co = 1. Thus the Fourier series of 
this function consists of a single term, which is 1. Now when we coipute the 
mth Cesaro inean of this series, the 1 occurs m times in the numerator. But 
we also divide by m. Hence 


Om(2;f =1)=1 


for all z and each m = 1,2,.... We now refer to equation (6) above. For the 
function we are treating here this equation becomes 
1 6 
l= — 1-K —y)d 
on ae m(z y) y 


This last expression is true for any z, so we may take z = 0. Finally, because 
Km(—y) = Km(y) by (i), we have proved (iii). 

Now suppose that 0 < 6 < m and 6 < |x| < m. Then 32 and 36 are both in 
the interval where the sine function is monotonically increasing. Hence 


2 2 
1 ] 
in — < [sin —: 
(si 35) < (sin 5) 
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Given an open interval J that contains zero, we may choose 6 > 0 so that 
(—6.6) C J. Then 


sup{K,(z) | z € (—7%, 7} \ J} < sup{Kn(z) | 6 < [2] < 7} 
But this last term is less than or equal to 


e 2 
1 (==) l 
sup — { — ES 
1 YY l 2 
n \ sin 52 n| sin 56| 


Letting ™ — oc in this last expression proves our claim. 


Exercises 6 


1. Show that the sequence {(—1)"*+!}%,, which is clearly divergent, is 
Cesaro convergent zero. 


#2. If the sequence {a,,}°, is convergent to b, show that it is also Cesaro 
convergent to 0. 


+3. We call {km}, Fejér’s kernal. Show that K,,(z) = > (1— Bl ete, 


|jxl|<m 


oo 
4. Suppose that the series 2 a, is Cesaro convergent to b. Suppose further 


n=) 
that the sequence {na,}%, is bounded. Show that the original series 
niust converge to 6. Results of this kind (i.e., results that give conditions 
under which a series that converges in some general sense must converge 
in the ordinary sense) are called Tauberian theorems. 


5. Let {a,} and {b,} be two sequences of real numbers. Suppose that 
(a) i. ay > any, for all n; 
ii. lim a, = 0; 
n—-S 
ili. The series > b,, has bounded partial sums. 
(b) Show that >= anbn is convergent. Hint: Let so = bo, 3s; = bo + 


n 
ee >_ 8. Write agbo + a,b; +--+ aS Aosq + a;(8) — 39) + 
j=0 
a2(s2 — 81) +-°-. 
(c) Show that > anbn is convergent even if the b,,’s are complex, just 


as long as the partial sums of y~ b, are bounded. 
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OO oy, mn 

i z 

d) Sh that ) > — i t. M ally, sh that ) — 

(d) Show tha 27 is convergent. More generally, show tha 247 
converges when |z| = 1 and z £# 1. 


(e) If we take 6, = (—1)", show that this test, which is due to the 
Norwegian mathematician Niels Abel, gives us the familiar Leibnitz 
test for alternating series. 


(. Fejér's Theorem 


Scientists who are engaged in SETI (the search for extraterrestrial intelligence) 
call it the Fermi paradox. It first arose during a lunchtime conversation among 
the physicists Enrico Fermi, Emil Konopinski, Edward Teller, and Herbert 
York, which took place at Los Alamos sometime in the year 1950. They were 
speculating about when we could expect to achieve faster-than-light travel (as- 
suming this is possible). Each man gave his opinion, and the talk turned for 
awhile to scientific subjects of more immediate interest, when Fermi suddenly 
asked “So where is everybody?” It was an abrupt change of subject, but every- 
one knew what he meant. Given the age of the universe and the widely held 
belief that technically sophisticated civilizations are fairly common, even the 
most pessimistic estimate of when faster-than-light travel is achieved implied 
that visits from “elsewhere” should be. not frequent perhaps, but not really 
uncommon either. Yet we have had none! This question has been discussed by 
scientists ever since, but with no general agreement on an answer. It is said 
that when the Hungarian physicist Leo Szilard heard of it he quipped, “They 
are among us, but they call themselves Hungarians.” 

Szilard was famous for his off-beat sense of humor and, no doubt, would 
claim that his facetious remark also explained why the tiny country of Hungary 
has produced so many outstanding mathematicians. We shall discuss tle work 
of one of them, Leopold Fejér, here. We shall then use his result to complete 
our solution to the DP. 


Theorem 1. The Cesaro means of the Fourier series of any real-valued func- 
tion that is continuous on the unit circle converge to that function uniformly 
over the circle. 


Proof. Let f € C,(T) and let o,(z) denote the nth Cesaro mean of its (com- 
plex) Fourier series. Given € > 0, we shall show that there is a natural number 
N such that 


lon(z) — f(z)| <eé 
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for all x € T, whenever n > N. We shall use formula (6) and Lemma 1 of 
Section 6. From (iii) of that lemma we may write, for any fixed z, 


on(2) — f(x) = xf. Fl) Kula — that — » ~ f Kna(t)f(x)dt 


In the first of these integrals we set u = x — t and use the 27-periodicity of the 
integrand to write 


:  $(t)Ka(x —t)dt = - / _ f(z —u)Kp,(u)du 


—— f(z — u)K,(u)du = : f(z —u)K,(u)du 


us 


We may now write 
—_ 1 7 
on(z)~ f(z) = 5 f {fle #)- f(a) }Kalt)dt 
From this last expression we obtain 
lon(z) — f(z) < ie — t) — f(z)|Kn(t)dt (1) 
= tf e-9- sekaar+ 3 7 J gg f@~ 8 ~ LeNKn(ta 


where 6 > 0 is to be determined. 
The first of these integrals can be estimated as follows: 


4 
sz | We- 8 - HelKa(ta (2) 
1 r.) 
< sup If e- 8) Saiz : Ka(tyde 


< sup [f(e-t)~fla)ig- [| Ka(thdt= sup _If(e—#) - F(a) 
—6<t<d us —5<t<d 


—® 


where our change in the limits of integration is justified by (ii) of Lemma 1 of 
Section 6 and we have also used (iii) of the lemma. 
Let us now estimate the second integral appearing in (1). We have 


] 
Om Jeg f(E~ *) ~ Le) Katha (3) 
< [max ste)l] (sup Ka(e)}2n 
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Now suppose e€ > 0 is given. We use tlie uniform continuity of f to find 
6 > 0 such that 


f(z -t) - f(z) <5 


whenever t € [—6, 6] and x € [—7z, 7]. With 6 now fixed we see by (2) that the 
first integral on the right-hand side of (1) is less than $. 

To finish the proof, we use the last statement in Lemma 1 of Section 6. It 
tells us that we can choose a natural number N so that 


E 
sup K,(t) << ————_—_ 
oa 4 maz |f(t)| 


whenever n > N. Combining this with (3) we see that the second integral on 
the right-hand side of (1) is less than 5 whenever n > N. 


Definition 1. A sequence {f,,} of functions defined on a set D is said to 
converge pointwise on D if for each x € D the sequence of numbers { f,(z)} is 
convergent. A series Du;(xz) of functions defined in D is pointwise convergent 
in D if its sequence of partial sums converges pointwise on D. 


If f € C,(T) and if the Fourier series of f converges at 0) € T, then its 
limit must be f(@9). This follows from Fejér’s theorem. The partial sums of the 
Fourier series, when 9 = 09, converge and so their Ceséro means must converge 
to the (ordinary) limit. But we know that these means converge to f(@9), and 
this proves our claim. We have shown the following: 


Corollary 1. If f € C,(T) and if the Fourier series of f converges at any 
point 9 € T, then this series converges to f (4). 


Remark 1. We have seen that the Fourier series for the function f(z) = 
z*, -~7<a2a<n, f(x+2z) = f(z) all z, converges uniformly over T. Because 
uniform convergence over T clearly implies pointwise convergence on T, we now 
know that the limit of this series is f(z). This establishes the results found in 
Exercises 5, problem 1. 


Let n,, n2,..., n¢ be a finite set of integers, some of which may be negative, 
and let c), c2,...,c¢ be complex numbers. The function 


n 
7(0) = > cei"? 


j=l 
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is called a trigonometric polynomial. It is clear that 7(0) is in C(T). Moreover, 
the Cesaro means of any f € C,(T) are trigonometric polynomials. Thus: 


Corollary 2. Given any f € C,(T) and any € > 0 there is a trigonometric 
polynomial 7,(8) such that 


|f(0) — 7e(0)| <€ 
for all 0 € T. 


Corollary 3. (Weierstrass Approximation Theorem) A continuous real-valued 
function defined on a closed bounded interval can be approximated, as closely 
as we wish, on that interval by a polynomial. 


Proof. Let a, b be real numbers with a < 6, and let f be a real-valued function 
that is defined and continuous on [a,b]. Given € > 0 we shall construct a 
polynomial P,(x) such that | f(r) — P.(z)| < € for all z € [a, b}. 

Let us suppose, first, that [a,b] C (—z, 7). Then we may extend f, in any 
convenient way, to a function f that is continuous on [—7,7] and satisfies 
f(-7) = f(m). Then, by Corollary 2, we can find a trigonometric polyno- 

P € 
mial r(x) such that |f(zr) — r(xr)| < 5 for all x € [—7,7]. Now 7(z) can 
€ 
be written as d= cjei™®, where the c,’s are complex numbers, the nj’s are 
j=1 
integers, and ¢ is a natural number. For each fixed 7 we may expand the func- 
tion e'")* in a MacLaurin series that converges to this function uniformly over 
[—7, 7]. Hence for each j = 1,2,... ,@ we can choose a natural number k; such 


injz 
that a. jeje"? — a J By . Thus the function c;e'"7 can be 


ky. 
approximated, to within 3, by a polynomial P;(z) = go) a over 


(—7, 7]. It follows that 


_max f(z) - SAle)l < 


j=1 


_max  |f(z) rel max gree Alas 


i 
E tn cE E = 
; o> maz|cje""?* — P;(z)|S 5 + ee 


7. Fejér's Theorem 51 


Now any finite sum of polynoinials is a polynomial, so we may set P,(.r) = 
é 
> P;(x), and the proof is complete in this case. 
j=l 


If [a, b] is not contained in (—7, 7) we first define y(2) = (b—a)r+a and note 
that this function is continuous and maps (0, 1] onto (a, 6]. The inverse (Chapter 
Zero, Exercises 2, problem 2b) of y, y~'(x) = #5, is also continuous and, of 
course, maps {a, b] onto (0, 1). 

Now f og is a continuous function on [0,1] and so by the first part of the 
proof we can, given € > 0, find a polynomial P(z) such that max, I(foy- 


= ~ly,) — it Bae 
P)(z)| < €. It follows then that max, I(f Poy ")z| max, I(f oy —P)o 


1 


yg *(z)| < max, l(foy— P)(x)| < ¢. Because Po y~" is clearly a polynomial, 


we are done. 


Remark 2. It is customary to define harmonic functions to be real-valued. 
and consequently the boundary data for the DP must also be real-valued. This 
is why we restricted our attention to functions in C,(T). It is clear, however, 
that if f is a continuous complex-valued function on T, we may use formula 
(6) of Section 5 to define numbers c,(f) and formulas (7) and (8) of that 
section to define numbers a,,(f) and b,,(f). These numbers are called the Fourier 
coefficients of f. The series }> c,(f)e'”® is called the complex form of the Fourier 
series for f, and the series $ao(f) + )>(an(f) cos nO + bn(f) sin n@) is called the 
real form of the Fourier series for this function. This terminology, althongh a 
little misleading, should not cause any problems. Observe that from Exercises 
2, problem 6c, we must. have | ie Cn(f) = 0, im an(f) = tim b,(f) = 0. 


Furtherinore, for every « > 0, there is a trigonometric polynomial 7,(8) such 


that |f(@) — 7.(0)| < € for all 0 € T. 


Exercises / 


x1. Given any f € C,(T), show that there is a sequence of trigonoimctric 
polynomials 7,, that converges to f uniformly over T. 


*2. (a) Show that the complex Fourier series of any trigonometric polyno- 
mial 7(6@) is 7(@). 


r 
(b) If r(y) = > cje'™)”, find the real Fourier series of 7(\). 


j=l 
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l-re'~ 


Show that -—1 + 2Re | , is equal to mee (Chapter Zero, 
Exercises 4, problem 6). 


. Let f(x) = |z| for —7 < xz < 7 and let f(z + 2m) = f(z) for all z. Then 


f€C,(T). 


(a) Compute the real Fourier series of f(z). 


(b) Discuss the convergence of this Fourier series and find its limit. 


. Suppose that f € C,(T) and that f’ and f” both exist and are in C,(T). 


Show that the Fourier series for f converges to this function uniformly 
over 7’. 


If f € C,(T) and if every Fourier coefficient of f is zero, show that f is 
identically zero on T. 


. If f is a real-valued function that is defined and continuous on [a, 6], show 


that there is a sequence {P,,(z)} of polynomials that converges to f(z) 
uniformly over this interval. 


. Let f be a real-valued function that is defined and continuous on (0, 1]. 
1 


The moinents of f are the numbers f(z)z"dz, where n = 0,1,2,.... 


i) 
If two functions have the same moments, show that they are identical. 


. If f is a continuous real-valued function on [0,1], we set B,(f,z) = 


a (7) f (=) z*(1—2)"—* for each n = 1, 2, .... These are the Bern- 
k=0 

stein polynomials of f. It can be shown that they converge to f uniformly 
over [0. 1]. 


At Last the Solution 


Let us now solve the DP for the unit circle T with boundary data f. The real 
Fourier coefficients of f, {a,}, {b,}, are bounded (Exercises 5, problem 2) and 
so the function 


oo 
u(r, @) = =a0 + 2 r” (a, cosn@ + b, sin n@) (1) 


n=l 
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is harmonic in G (recall that G is the interior of T). This was shown in Exercise 
4, problem 2. Using formulas (7) and (8) of Section 5 we may write 


u(r, 0) ’ + =" cos n@ + b, sin n@) = 


oe a, (y)dy + — - Sor ” f(~) cos nd | cos nO 
0 


i eae 


2n 
+( f f(y) sin upd) sino} 
l 2n 1 co 2n ‘ 
= 59 Merde+ 2¥o | Ser" casn(e — Bdp 


The series (1) converges uniformly over any closed subdisk of G (i.e., when 
r <1ro where ro < 1 is fixed), and so by Lemma 3 of Section 4 this last sum 
can be written as follows: 


l 2n oo : 
u(r,6) = 5 [ Hie) {425% conto - 0) dp 


In the series inside the integral we set y — 0 = w and note that 


oo [oe] 
1+2) > r®cosnw =-1+2) > r"cosnw = 


n=1 n=0 


oo 
—1+2Re Soren = 1+ 2Re |; : =| 
=! —re 


2 


_ l-r 
1 +r2 - 2rcosw 


where Re denotes the real part (Chapter Zero, Exercises 4, problem 6) of the 
expression in brackets and we have used problem 3 of Exercises 7. Thus 


fa l—r? 
u(r, @) = = | SW) rel (2) 


and we have shown that when f € C,(T), the function defined by this integral 
is harmonic in G. 

We now want to do two things. First, we want to show that the series (1) 
solves the DP; i.e., we want to show that u(1,9) = f(@) on T. Second, we want 
to show that (2) salves the DP in the sense that 


l-—r? 


1 2n 
(0) = u(1,6) = lim > |, AW) arco)” 
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This gives us a nice, compact, representation of the solution: We shall do this 
in two steps. 

Suppose, temporarily, that the Fourier coefficients of f are such that the 
series 


1 fe @ 
5 lao + > (lan! + |b, |) (3) 
n=] 


is convergent. Then the Fourier series for f converges uniformly over T, and so 
its limit must be f(0) (Section 7, Corollary 1). Thus the series (1) converges 
uniformly over G UT to a function u(r,@) that is continuous on this set and, 
moreover, u(1,6) = f(6@). We have already seen that u(r,@) is harmonic in G, 
and so under the assumption (3), the function u(r, @) given by (1) is the solution 
to the DP. In particular, we must have 


lim_ u(r, 6) = f(8) 


and because in G, u(r.@) is given by (2), we must also have 


1—r? 


. l 2x = 
lim — | f(¢) 1+r? — 2rcos(p — 0) 


oe | dy = f(8) 


So we have proved the two things we wanted to prove, but only for functions 
f € CT(T) that satisfy (3). 

Now suppose that our boundary data are merely continuous on T. We may 
choose a sequence of trigonometric polynomials {7,,(6)} that converges to f(@) 
uniformly over T (Section 7, Corollary 2). Now each 7,,(6) certainly satisfies 
(3), by problem 2 of Exercise 7, and so for each n the function 


pent = 1-r? 7 

un(r, 0) = xf Tal) 7 — reos(y — 8) ¥ 

is harmonic in G, continuous on G U7’, and equal to 7,(@) on T. It follows, 
because lin 7,,(0) = f(@) uniformly over T (Exercises 4, problem 1) that the 
sequence {u,(r,@)} converges uniformly over G UT to a function u(r, 0) that 
is continuous on this set and satisfies u(1,0) = f(@) on T. But clearly, in G, 


un) =2 [to ap (4) 
pte Det Je YT +r? — 2rcos(p — 8) 

because {7,,} converges to f(6) uniformly over (0, 27]. Now u is continuous on 
GUT, u=f on T, and u is given by (4) in G, hence 


l-r? 
19? orcsle =) = f(0) for each 0 € T. 


fa - 27 


2” 
we] Fe) 
0 
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Exercises 8 


1. Equation (4) is called Poisson’s integral. The family of functions P,(@) = 
rotting: 0 <r < 1is called Poisson’s kernel. One should compare (4) 
with equation (6) of Section 6. See also Lemma 1 of that section. Show 
that 


(a) P,(@) > 0 and P, is continuous on T for all r € (0, 1). 

(b) + [°" P,(@)d8 = 1 forO<r <1. 

(c) If0 <6 < 7, then lim sup |P,(@)| = 0 
r=! (9\>6 


Hint: If 6 < || < T, then P,(8) < ee 
2. Verify that P,(6) is harmonic in CG. 


3. The formula (4) enables us to write down the solution to the DP for 
a circle of radius R centered at the origin. If f is continuous on the 
circumference of this circle, then setting r = 7 and s = Ry gives 


L peee R? - r? 
une anR Jo F(s) R2 + r2 — 2Rrcos(y — 6) = 


Notes 


1. The work of D’Alembert, Euler, and Bernoulli on the vibrating string 
problem is discussed in Trigonometric Series—-A Survey, by R. L. Jeffery 
(University of Toronto Press, Toronto, 1965). 


2. A discussion of Riemann’s function, and the mysteries surrounding its 
origin, can be found in P. L. Butzer and E. L. Stark, “Riemann’s example 
of a continuous nondifferentiable function in the light of two letters (1865) 
of Christoffel to Prym,” Bull. Soc. Math.Belg. 38 (1986), pp. 45-73. 


J. Gerver published two papers about this function: 


“The differentiability of the Riemann function at certain rational multi- 
ples of 7,” Amer. J. Math. 92 (1970), pp. 33-55. 


“More on the differentiability of the Riemann function,” Amer. J. Math. 
93 (1970), pp. 33-41. 


3. The Fermi paradox is the subject of a book, Where Is Everybody?, by 
Stephen Webb (Copernicus Books in association with Praxis Publishing, 
Ltd., 2002). See especially pages 17-18 and 28-29. 


Chapter Two 


Extensions of the 
Classical Theory 


Fourier series are used to solve many problems arising in engineering and in the 
physical sciences. The functions encountered here are generally real-valued and 
defined only on some interval of finite length. They are not periodic and often 
have several points of discontinuity. To discuss the mathematics behind these 
applications, we must reformulate the definitions given in the last chapter to, 
somehow, include more general functions. 

It was the French physicist Fourier who stated, in 1807, that an arbitrary 
function could be represented by a trigonometric series provided the coefficients 
in that series were computed from the Euler formulas. He offered no proof of 
this, but he used these series extensively to solve physical problems and his 
work brought harmonic analysis to the attention of his colleagues. This is why 
these series now bear his name. 


1. Functions on (—7z, 7) 


The Euler formulas imply that any function we use them on must be integrable. 
We are supposing that the reader is familiar with the Riemann integral and 
most, perhaps all, functions arising in applications are integrable in this sense. 
So throughout this chapter “function” means “Riemann integrable function.” 
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Definition 1. Let the function f be defined on (—z,7). Then the Fourier 
coefficients of f are 


Rn 


tn = = f(z) cosnzdz , n=0,1,2,... (1) 


—® 


R 
by = = f(z) sinnz dz , n=1,2,3,... (2) 


—f 


The (real) Fourier series of f is 


x 
a9 +. (an cos nz + b,, sinnz) (3) 


n=1 


It is customary to write 


f(z) ~ a + Ss cos nz + b, sinnz) 


n=1 


to indicate that the series is the Fourier series of the given function. i.e., that 
the a,,’s and 6,,'s were computed from f(x) using the Euler formulas. 


We can define a new function, f(z). by setting 


f(r) = 520 + 5 tes cosnz + b, sin nx) 


n=) 


at every x at which the series is convergent. The hope. of course, is that f(z) 
equals f(z) for each z € (--7,7). Now if the series does converge, pointwise. 
on (—7, 7), then because sine and cosine are 27-periodic, the series converges 
everywhere except, perhaps, at odd multiples of 7. Thus f is defined at all points 
of R except, perhaps, at the points mentioned. Moreover, f is 27-periodic, 
meaning f(z + 27) = f(z) alla dg {nm[neZn odd}. Thus, unless f is 2x 
periodic, the best we can hope for is that f(x) = f(x) in (—7,7) or [-7, 7], 
and the following examples show this clearly. 


Ezample 1 


Consider the function f(x) = x? on (—7. 7). We have seen that 


f(t) ~ e445 e ae 


n=1 
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and the series converges, uniformly in fact, for all z (Chapter One, Exercises 
5, problem 1). Thus 


: x? —-] 
f(z) = tt ( 
is a 27-periodic function on R, a function in C(T). 

It is tempting to say that the series converges to z”, but this is true only on 
[—7, 7] (as we saw in Chapter One, Section 7, Remark 1, this particular series 
converges to z? at both 7 and —7). If we set x = 27, the series converges to 
f(2x) = f(0+27) = f(0) = 0 and not to f(27) = 42”. The graph of f(z) = z? 
is well-known; the graph of f is shown below: 


—3r —T T 37 a 


Ezample 2 


Let f(x) =z for all z € (—7,7). Then 


epee 


f(z) aor sin nz 


as is easily seen. We shall see that this series actually converges at each point 
of (—7,7) and that its limit, f(z), is equal to f(z) on this interval. But it is 
clear that the series converges at every odd multiple of 7 as well and that its 
limit is zero. Thus f(n7) = 0 for all odd n (also for even n in this case). The 
graph of f is shown below: 


The dots are on the z-axis at +7, +37, +57, etc. 
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Recall that a function f defined on (—a,a) is said to be an even function 
if f(—x) = f(x) for all x in this interval. It is easy to see that, in this case, 


c f(z)dz = af f(x)dz 


A function g(z) on (—a,a) is said to be an odd function if g(—z) = —g(z) 
for all x in this interval. Again one can easily see that, in this case, 


J o(2dz =0 
—a 
Now when f is an even function on (—7,7), then f(z) cosnz is also an 
even function for all n = 0,1,2,.... Hence. 
Qn = : f(x) cosnz dx = = [ f(z)cosnxdz, for each n 
ae T Jo 
Also, because f(z) sin nz is an odd function for all n = 1,2,..., 
b, = ~| f(z)sinnzdz =0_ for each n 
Similarly, when g(x) is an odd function on (—7, 7) we have 
1 T 
ig < g(x) cosnzadr=0, n=0,1,2,:-- 
and 


b,, = ~| g(z) sinnz dx = = [ g(x) sin nz dz, n= 1,2,:--- 
0 


a —-t 


Examples 1 and 2 illustrate these results. The function f(z) = z? is even 
on (—7.7). We have seen that its Fourier series contains only even functions, 
every 6, = 0. and that 


1 n? a 2(-1)" 
50 = > s0 a =2 (=) and an =2| 72 . n= 12.45 


The function f(r) = z is an odd function on (—7,7). Its Fourier series 
n+t 
contains only odd functions, each a, = 0 and 6, = gu n=1,2,---. 


Exercises 1 


1. (a) Show that the Fourier series of f(z) = z on (~7,7) is 
oo 
(-1)"*} 
2 > _ sin nz 


n=l 
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(b) Show that the Fourier series of f(z) = e7 on (—7,7) is 


2 sink l —1)" 
sinh 7 pe ) 


(cosnzr — nsinnz) 
n 


T 2 


(c) Let f(z) = 1 ec Bare and let f(z) = 2 when 0 < x < 7. 


Show that f ~ ; + — > oe cr 1) sin nz. 


(d) Let g(x) = 0 when —7 < x < 0, and let g(x) = sinz when0 < zr < 7. 


1 1 2 2 
Show that g ~ — —+ 5 sina — >} 7 —. 


2. (a) Let f be an even function on ae and let g be an odd function 
on this interval. Show that 


[nerds =2 f° pteaz and [sade =0 


(b) Show that f - g(x) = f(x)g(zx) is an odd function. 


3. Give f defined on R set fo(x) = f=) f(-2) and set f-(r) = Lz)+f(-=) 
Observe that f(z) = f-(z) + fo(zx). 


(a) Show that fo is an odd function and f, is an even function. 
(b) Compute f, and fg for f(r) = e*. 


2. Functions on Other Intervals 


Suppose now that our function is given only on (—a,a). By analogy with what 
we did in the last section, we would expect that the Fourier series of the given 
function should converge to a 2a-periodic function. Now the period of sin bz is 
ar This is 2a when b = z/a. So we would expect our series to be of the form 


500+) [2x e08 (= ) +b, sin (=) (1) 


Given a function f on (—a,a), we define the Fourier coefficients of f to be 


the numbers 
--/ f(z) cos (222) ar, n=0.1.2.+": 
Qa Ja a 


and 


n= = | f(z) sin (==) ar, n= 1,2.3,-:: 
Qa Ja a 
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Having found these numbers we define the Fourier series of f to be (1). 
As an example, consider f(z) = z? on (—a,a). We find that 


a2 4a? — (-1)" NTL 
fal~ s+ Lae oT) 


When a = z we obtain the series found in the last section (Example 1). 
It is clear that this series converges uniformly over R to a function f that is 
continuous and 2a-periodic (i.e., 2a is a period of this function). 

Recall that Fejér’s theorem tells us that any function in C(T) is the uniform 
limit of the Cesaro means of its Fourier series. Given a continuous 2a-periodic 
function f we may regard f as a function on the circle with radius a/7 centered 
at the origin. A change of variable in Fejér’s theorem tells us that any such f is 
the uniform limit of the Cesaro means of its Fourier series (i.e., the series (1)). 


Very often the functions arising in applications are defined only on an 
interval of the form (0,a). Given f(z) on this interval there are two, among 
many, natural ways to extend it to (—a,a). We may define 


f(-z) for -a<2z<0 


fda)=} for0<z<a 


This is called the even extension of f. The Fourier series of this function is 
called the Fourier cosine series for f. We have 


f(z) ~ 509 + SS acs (=) 
n=1 


a 
where 


on, == [ f(z) cos (==) de, n=0,1,2,--- 
a Jo a 


If this series converges, its limit will be a 2a-periodic function, the periodic 
extension of f,(z). 
We may also define 


x for0<2r<a 
fo(z) = ie) 
—f(-z) for-a<z<0 
This is called the odd extension of f and its Fourier series 


foe(z) ~ by sin (=) 


n=l 
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where 
2 f° _ (Nur 
n= = | fo(x) sin (“~) de, n=1,2,3,--: 


is called the Fourier sine series for f. Again, if the series converges, its limit 
will be the 2a-periodic extension of f(z). 

These series are called the half-range expansions of f(z). Let us give some 
examples. 


Ezample 1 


Let f(x) = x on (0,7). Then fo(x) = x on (—7,7); technically, fg is not 
defined at 0, but this does not effect its Fourier series. That series is the one 
found earlier, specifically, 


——— 


fo(z) ~ ay cre (- = sin nz 


The even extension of f(z), f-(x), coincides with |z| (except, again. at 
x = 0). Here we have 


fe (x) = -$5) ar ma 


Ezample 2 
Suppose we modify Example 1 by taking g(z) = x on (0,1). Then ge(x) = x 
on (—1,1) and 


2(-1)"*! 


: 2 
b, =2 ] xsinnazdr = ——cosnrz = 
rt) nr na 


Thus 
(- 13 
go(z) = LS . sin n7Tx 


n=) 


The even extension, g-(z), coincides with |z| on (~1,1), giving us 


1 
ag = 2 | zdzr=1 
0 


: - —2(1 - (-1)” 
an = 2 | reosnnzde = —>* (1 —cosnn) = ULI") 
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and 


COS NTI 


law Ol (=1)" 
s(2)~3-D Maa 


Exercises 2 


1. (a) Let f(z) = —1 when -a < x < 0, f(z) = 1 when 0 < z < a. Show 


that 
4— 1 _ [(Qn-1)xz 
t~ 23 (g=7) | a 


(b) Let f(z) = 2+ 2? for -1 < z < 1. Show that 


2 
fr +2 +— Lt 1)” Ee cos naz — + sinn| 


n=1 


(c) Let f(z) = 0 when -2 < xz <1, f(z) = 1 when 1 < z < 2. Show 


that 
1 _ LSS sin 00s 2 + (cosmm — cos %) sin 2 
4 an 2 2 2 2 
2. Here we compute some half-range expansions: 
(a) If f(z) = sinz when 0 < z < 7, then fo(z) = sinz and 
2 4 cos2nz 
felt)~ 2a 1 
(b) If f(x) = cosz when 0 < x < 7, then 
8 — nsin2nz 
fo(z) ~ = ys hr? - 1° f-(x) = cosz 
2— nsin Nx 
(c) If f(z) = e* on (0,7), then fo(z) ~ = > [1 — (-1)"e"] 
e* — COS NI 
and fe(z) ~ n241° 


(d) If f(z) = a — =z on (0,7), then 


fol) ~ D3 SINT | fela) ~ +23 aoe 
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3. Functions With Special Properties 


The functions that arise in the applications of Fourier analysis to engineering 
are, as illustrated by the examples given above, usually continuous except at 
a finite number of points. Moreover, at these exceptional points the functions 
often have one-sided derivatives (defined below). Here we shall investigate the 
relevant properties of functions of this type. 

Let a.b be real numbers with a < 6. A function f that is defined and 
continuous on (a,b) and for which the limits, lim, f(a+h) and tim, f(b-h), 
both exist (i.e., they are real numbers) can obviously be extended to a function 
f that is defined and continuous on [a, 6]; just take f(z) = f(z) in (a,b) and 
set f(a) equal to the first of these limits and f(b) equal to the second one. 
Because f is continuous on [a,b], it can be approximated by a polynomial on 
that interval (Chapter One, Section 7, Corollary 3). So it follows that f is 
bounded on (a,b) and, given € > 0, there is a polynomial P,(x) such that 


|f(z) — Pe(z)| <€ 


for all z € (a,b). 
We can characterize these functions another way. Recall (Chapter Zero, 
Section 5, Definition 3): 


Definition 1. The function g(z) defined on (a,b) is uniformly continuous on 
that interval if. given ¢ > 0, there is ad > O such that |g(x)—g(y)| < €¢ whenever 
z,y € (a,b) and |x — y| < 6. 


It is easy to see that if g is uniformly continuous on (a,b), tim, g(a +h) 
and lim g(b — h) both exist, and conversely, if f is continuous on (a,6) and 
these one-sided limits exist for f, then f is uniformly continuous on (a,b) (see 


Exercises 3, problem 4). 


Definition 2. A function f defined on (a,6) is said to be piecewise contin- 
uous, or sectionally continuous, on this interval if there is a finite (perhaps 
empty) set {r;}"_, € (a,6) such that f is discontinuous at each x; and f is 


uniformly continuous on each of the intervals (z,.27;41), j = 0,....2 where 
Lo = 4, In41 = b. We shall call {x;}%*5 the subdivision of f. 


Given any y € (a,b) we set 
fly +0) = lim, f(y + A) 
and 


f(y -0) = lim, fly -h) 
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When f is piecewise continuous on (a, b) these limits always exist. For any 
such function the numbers 


f(a+0) = lim, f(a + h) 
and 
f(b-0) = lim, f(b— h) 
also exist. 
Theorem 1. If f is piecewise continuous on (a, 6), then Jim, / f(z) sin Az dz = 


b 
0 and tim, f f(z) cos de dr = 0. 
Aco J, 


Proof. It is clearly sufficient to prove this result for a function f that is uniformly 
continuous on (a,b). Now given € > 0 we must show that 


<€é 


[ Steyeosrzes 


for all \ > A; i.e., given € > 0 we must show that we can find A such that this 
inequality is true. Now we may choose a polynomial P.(z) such that 


b 
E 
[ te) - Pela)las < § 
for all we need do is choose P,(z) so that 


E 
f(z) — Pe(z)| < 2(b — a) 


for all z € (a,b). Clearly then 


b b b 
J s(2)c08 kzde < | f(z) 00s Ax de — [ P,(x) cos Az dz| + 


b 
| P,(z) con de 


b 
< [ is(2) - Pe(z)|loos Azldz + 


b 
| P(x) cos Ax ty 
a 


b 
=.2 : P,(z) cos Ax dr 


Norm 
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In this last integral we may integrate by parts. We find 


[ Pelz) 08 Ax dz = = — 5 | PKe)sinarde 
and so 
a ae ; b 
[/ Pez) cos rede < Pelt) sin 00 ~ Relea 5 [ PA(a)sin za 
a a 


We can choose A;, so that the first term is less than ¢/4 when \ > A). The 
term containing the integral can be estimated as follows: 


(b — a) 
d 


b 
/ P.(z) sin Ax dx 
a 


1 
5 < sup |P:(z)| 
a<xz<b 


where we recall that P.(z) is a polynomial; hence, P(x) is uniformly continuous 
on (a, 6) and, in particular, bounded on that interval. Thus we can find A2 such 
that this integral is less than ¢/4 whenever \ > Ag. Clearly then 


<E 


b 
/ f(x) cos Az dr 


whenever \ > max{Aj, Az}, and the proof is complete. 
A similar argument shows that 


b 
tim f f(z)sin dvd = 0 
Ac Ja 


Corollary 1. The Fourier coefficients of a piecewise continuous function tend 
to zero as n — oo. This generalizes Riemann’s theorem (Chapter One, Section 
5, Theorem 1). 
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We recall that if zo € (a,b), we say that f is differentiable at this point 
provided 


lim f(zo +h) — f(zo) 
h—0 h 


exists. We denote this limit by f’(z0). Now 
faa + hy = |A0F A= M0) 5 4 F029) 


and so, when f’(o9) exists, lim f(z0 +h) = f(zo); i.e., differentiability of f 
at ro implies the continuity of f at that point. We shall now define the one- 
sided derivatives of f obtaining a useful generalization of the derivative that 
can exist even at discontinuities of f. We shall use the notation introduced just 
after Definition 2. 


Definition 3. Let f be piecewise continuous on (a,b), and let ro € (a,b). 
Then the right-hand derivative of f at ro, f(Zo), is defined to be 


lim 270 +4) — f(zo + 0) 

h—0+ h 
provided that this limit exists. We define the left-hand derivative of f at 
Zo; f' (20), to be 


lim f(z0 — h) — f(z0 — 9) 
h—0+ h 


provided that this limit exists. 


It is clear that when f has a derivative at zo, then both its right- and 
left-hand derivatives exist and both are equal to f'(zo). The function f(z) = 
2r, r <1, f(z) = 1-12, 1 < 2, however, is discontinuous, hence nondifferen- 
tiable, at zr = 1, but f(1) and f/(1) both exist. The first of these is —1; the 
second is 2. 


Lemma 1. If the two functions f,g both have a right-hand derivative (or a 
left-hand derivative) at a point zo, then also so does f(z) - g(z). 


Proof. For any h > 0 we may write 


f(z0 + h)g(zo + h) — f (Zo + 0)9(z0 + 0) 
h 
g(Zo + h) — g(z0 + 0) 
h 


(zo + h) — f(zo + 0) 


=f (xo +h) + 9(Zo + 0) |! h 


and the result follows from this. 
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Not surprisingly, we can define the right-hand derivative at a and the left- 
hand derivative at b for a piecewise continuous function f on (a,b). They are 


fi.(a) = tim, f(a+h) : f(a +0) 


and 


. f(b—h) — f(b- 0) 
‘(b) = i a eS 
f- (6) i, h 

We can give conditions on f that imply the existence of these derivatives. 


Lemma 2. Let f be uniformly continuous on (a, 6). Suppose that f’(z) exists 
at every point of this interval and that f’ is uniformly continuous here. Then the 
right- and left-hand derivatives of f at a and b, respectively, exist. Furthermore, 


f.(a) = f'(a +0) and f7(b) = f’(b— 0). 


Proof. We may define a continuous function f on [a,b] by setting f(r) = f(x) 
when x € (a,b), f(a) = f(a +0). f(b) = f(b—0). Clearly, f has a derivative 
at each point of (a,b); in fact. f’(z) = f’(x) and every point of (a,b). By the 
inean value theorem (Chapter Zero, Section 5, Theorein 5) we inay write 


f(a+h) — f(a) _ 


7 f'(a + 0h) 


where h > 0, and 6. 0 < 8 < 1, depends on h. But this is just flash) fas) = 
f'(a + 9h) and, letting h — 0* proves the result at z = a. The proof for the 
point z = 6 is similar. 


Corollary 1. If f is piecewise continuous on (a,b) with subdivision {z, yoo 


(see Definition 2), and if f’ is piecewise continuous on (a,b) with the same 
subdivision, then the one-sided derivatives of f exist at every point of [a, }}. 


Exercises 3 


* 1. Show that f(r) = Tania has both a left- and a right-hand derivative at 
x = 0. What is lim f(z)? 


2. Let f(x) =z on (—7,7). Let f(x + 27) = f(z). Compute: 
(a) f(m—0); = (b) f(m +0): = (c) f'(m-0); — (d) "(7 + 0). 
3. Let f(x) =zifO<2< §, f(z) =a —-2 if § <r < 7. Compute: 


(a) f(¥- 0): = (b) F(Z +0); (ec) F(Z - 0): = (d) f'(§ +O). 
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*4. Some authors define f to be piecewise continuous on (a,b) if there is a 
finite (perhaps empty) set {z;}7_, such that f is continuous on (xj, 2541), 


7=0,1,...,n, where a = 2p and b = 2,4), and bo f(z), lim f(z) 
z—2z, 


exist for each j = 1,2,...n, lim. f(x) exists and ‘lim f(z) exists. Show 
z~—a —oO~ 


that this is equivalent to our Definition 2 as follows. Clearly, it suffices to 
prove the result when {z;} = ¢. 


(a) If f is continuous on (a,b) and lim f(z), lim f(z) both exist, 
define f on [a,b] by letting f(z) = f(z) when z € (a,)), f(a) = 
jim, f(z), f(b) = Jim f(z). Then f is continuous on the closed 


hounded interval {a, b]. Now use Theorem 3 of Chapter Zero, Section 
5 to show that f is uniformly continuous on (a, }). 


(b) Now suppose that f is uniformly continuous on (a, 6). We must show 
that lim f(z) and lim f(z) both exist. 
z—a z—O~ 


i. If {z,} C (a,b) is a Cauchy sequence (Chapter Zero, Exercises 
5, Definition 5), show that {f(z,)} is a Cauchy sequence. 

ii. Choose {r,} € (a,6) such that jim In = a and let a = 
tim f (zn). Given € > 0, find 6 > 0 such that |a— f(a+h)| <e 
whenever 0 < h < 6. Thus lim, f(z) = a. The limit of f at 6 
can be treated similarly. — 


4. Pointwise Convergence of the Fourier Series 


In proving Fejér’s theorem we made use of an integral representation of the 
Cesaro means of the Fourier series. This involved a sequence of functions that 
we called Fejér’s kernel. Recall 


2n 
om(eif)= 5 [| Km(e—v)s(u)dy 


where 


Pane eee > |e a 1 4) 


l1—cosz m \ sin 5z 


Here we shall derive an integral representation for the partial sums of a 
Fourier series. This, too, involves a sequence of functions that we shall call 
Dirichlet’s kernel. We find that 


sn(ai f= =f f(t) Dalt ~ 2)at 
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where 


sin(n + 4)z 
Dee panda 


We begin with a lemma due to Lagrange. 


n e PRY’) 
Lemma l. 25 mig ae 
sin 59 


k=1 


n n 
Proof. 2(cos 6 + cos 26 + ---+cosn@) = y- etkO 5 »- e'*9 We may sum each 
k=1 k=1 
of these finite geometric series to get 


n e'9 (1 x ein?) e~9(1 a e— ind) 
re ee eee ah 


— 918 _— ot 
ram l-—e l-e 


e'F(1 = ein) (1 _ e~ ind) 
1 — e'9 e'9(] = e~'9) 


1 — e'9 1 — e9 1 — e? 
In this last quotient we multiply numerator and denominator by e~*8/? to get 


n eif — ei(n+4)e _ e7'f + e7i(n+4)@ 
2 ) cos ké. = —————_,. 
k=1 ett ~ ef 


(e~ 19/2 _ @i8/2) — e-i(n+4)8 _ eiln+h)e 


e7'§ ~ ef en _ ef 


ae cos(n + 3)6 —isin(n + $)@ — {cos(n + $)0 + isin(n + +)6} 
cos § — isin § — (cos 5 + isin 5) 


_ 2isin(n+3)0 ; sin(n + 4)6 


- 2 @ oe | 
—2isin 5 sin 59 


and the proof is complete. 
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Now given a function f on (—7, 7) the nth partial sum of the Fourier series 
of f can be transformed into an integral as follows: 


v 


8n(x; f) = 5 + 2 (0% coskzx + by sin kr) 
= [ f(t) Jat + — Sitoaske f” f(t) cos kt dt 


+ sinks | f(t) sin At dt) 


us 


1. loc 
if" 1043 + Soomute— 2) ba 


Using Lemma 1 inside the curly brackets we obtain 


1 f* loc 
— / f(t)4¢=+ S (cos kx cos kt + sin kx sin kt) 2 dt 
7 2 k=} 


su(r: f) -if f(t pe a (1) 


= 2sin(3(t — z)| 


We shall use (1) to investigate the convergence of the Fourier series of f. 
There is no loss in generality in assuming that f is 27-periodic, and we shall 
assume that from now on (Section 1, just after Definition 1). 

In (1) we set u = ¢t — x to obtain 


|; “for sin[(n + 4)u 
sale =a f futz) eta 
ian Ee sin 5u 


In here we replace a ane by D,(u). The reader can check that the func- 


tion D,,(u) is 27-periodic. Thus (Chapter One, Exercises 2, problem 3) our last 
expression for s,,(z; f) becomes 


Sy(xz: f) = [ f(ut+2z)Dy(u)du 


Lemma 2. For each fixed n the function D,(u) is even (Section 1, just after 
Example 2) and, furthermore, 


/ : D,,(u)du = 2 / : D,,(u)du = 
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Proof. We shall use Lemma 1. We have 
® ae 1 
D,(u)du = | sinl(n + 3)u] 


= mJ_, 2sindu 
: ico? "ead 
ae ae kay" ~* 


=; [5@n) 
=1 


Theorem 1. Let f be a 27-periodic function that is piecewise continuous on 
(—z,7). Then the Fourier series of f converges to 


l 
slf(z +0) + f(x - 0)] 
at every point z where f has both a left- and a right-hand derivative. 


Proof. Let xz be a point where f has both a left- and a right-hand derivative. 
There is no loss in generality in assuming that z € (—7, 7). We must show that 
Yi - 0 

lim sa(z;f) = lim | f(u+2)Dp(u)du = fet oy+ fen) 


and to do this it suffices to show that 


tin a ft f(u+z)D,(u)du = f(z =~ dl 
tin of f(u+z)D,(u)du = 0) 


By lemma 2 the first of these is equivalent to showing that 
lim [ [f(u +x) — f(x +0)]D,(u)du = 0 
n--Oo 0 
This integral can be written as follows: 


sin[n + 5 2 2 ul} 


[wrens (z + 0)]| ———<—du = 


27 sin uu 


=f <= sin|(n + 5 )uldu 
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By Theorem 1 of Section 3 this tends to zero as n — oo, provided the function 
in the square brackets is piecewise continuous on (0,7). The only potential 
difficulty is that the limit, as u — 0*, might not exist. This, however, is not 
really a problem because 


lim f(u+z) - f(z +0) 


u—0+ sin du 


= Jig, [Sete = He +9) (4) 


2 lim fete) f(z +0) 


u—0+ 


and this exists because f has a right-hand derivative at z. 
The fact that 


0 
jim, [ [p(w +2) - f(2 - 0)|Da(u)du = 0 


can be proved in a similar way. 


Remarks. 


(1) At any point zo where the function f is continuous and has one-sided 
derivatives, the series converges to f(zo). This is true, in particular, at 
any point where f(z) is differentiable. 


(2) An analogous theorem is true for any piecewise continuous, 2a-periodic 
function (see Section 2 for the definition of the Fourier series of such a 
function). 


(3) If f is defined only on (—7,7), then the theorem applies to the periodic 
extension of f to all of R. Hence, if f is piecewise continuous, its Fourier 
series converges to 3[f(z+0) + f(z —0)] at each interior point (i.e., point 
of (—7,7)) where both one-sided derivatives exist. At both end points, 
x = 7 and z = —7; however, the series converges to 4[f(m—0)+ f(-7+ 
0)] provided f has a right-hand derivative at s = —7 and a left-hand 
derivative at x = 7. Thus if the series is to converge to f(—7 +0) at x = 


—n or to f(7 —0) when z = 7, it is necessary that f(—7 +0) = f(a —0). 


ww” 
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Let us return now to equation (1). Recall 


" f(t)sin[(n + 5)(t - 2) 


= 2 sin[3(t — z)| a (1) 


8,(Z; fy=- 


=i f f(t)sin(n + 3)u a 


oe sin $u 


° f(z+u)sin(n + 2)4 a, 
= 5, sin au 


pe + u)sin(n + 5 guy 
x sin du 


where we have set u = t — x. In the first integral on the right we set v = —u to 
obtain 


f(x —v) aul + DY fe f(z —v) ane + 43)v 
Qn Lf sin }v eer y Ca -xf[ sin $v = agate, 
Hence (1) becomes 
én (x: f) = * f(x +u) a + 2)du i 
sin 5u 
f(z —-v) ae f(z — v)sin(n + 3)v , 
+ oF oy sin = Su a 
=x | ST 2 pla + w) + He wh 


We can put this into a more illuminating form by first noting that when 
f(x) =1 for all z we have ap = 2 and s,(z; f) = 1 for all n. Thus 


1 f* sin(n + 3)u 


l= 
27 Jo — Sin du 


(2du) 


Multiplying this by s and subtracting from s,(z; f) gives us 


Bena [ snl )E yeaa je aa) aah 


yrs sin 5u 


Thus a necessary and sufficient condition that s,,(z; f) converge to s is that 
this last integral tends to zero as n — oo. There are tests for convergence that 
are based on giving conditions on f for this to be true (see Notes). 
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Exercises 4 


1. In each of the following cases discuss the convergence of the real Fourier 
series of the function given. 


(a) f(r) =27, -™<aK<t 
(b) f(z) =e", -7<xr<t 
(c) f(z) =1, when -—7 < «<0 and f(z) =2whenO<2<7 
(d) f(z) =0, when —7 < x <0 and f(z) =sinz when0<2<7 
2. The nth partial sum of the Fourier series of a function f(t) oscillates near 


a discontinuity of this function. This is known as the Gibbs phenomenon 
or the Gibbs effect. We give an illustration of this in the following exercise. 


(a) Let f(t) =a-—tforO<t<x7z, f(t) = —a — ¢ for —17 <t <0. Show 
sinnt 


that the Fourier series for f is 2 > 


m=) 


sin nt 


(b) Let us set f(t) = 2 > s 
of fx for large k near the we t = 0 of f. Note first that 


fi(t) = stn(htigt by Lemma 1. 


(c) The first critical point (i.e., the first point where f’ is zero) of f, to 
' sin(k + 4)¢ 
sin 5t 


— (x —t) and investigate the behavior 


as | 
SIN >t 


the right of f = Oist, = -¢y. Show that f(t) = / dt— 
2 0 


Coral 
ae 


(d) In the integral in part (c) set y = (K+ 4)t and use the fact that 


wt 4 Hl 
lin —— Ne = 1 to show that lim f,(t,) = 2 | se dy — 7. 


yes 

) Numerical evaluation of this last quantity shows it to be ~ 0.562. 
Now as k — 20, th — 0, but. as we have shown, the difference be- 
tween f(t) and its Ath partial sum (i.e., f;) does not tend uniformly 
to zero. However large k is, there are points arbitrarily close to zero 
at which f;, is near 0.562. 


-_a-,, 
cee) 
~~’ 
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Notes 


The work of Fourier is discussed in the book by Jeffery listed in the Notes to 
Chapter One. The tests mentioned at the end of Section 4 can be found in 
The Theory of Functions, by E. C. Titchmarsh (Oxford University Press, 2nd 
edition, London, 1939), see pp. 406-410. Further discussions of the engineering 
applications of Fourier series can be found in Advanced Engineering Mathemat- 
ics, by Erwin Kreyszig (John Wiley & Sons, Inc., 8th edition, New York, 1999), 
see pp. 925-648. 


Chapter Three 


Fourier Series in Hilbert 
Space 


There is another way to look at Fourier series. Before we can present this more 
geometric point of view, we must first review some linear algebra. We begin 
informally and leave a more systematic discussion to the next section. 

We recall that the vector space R” has an inner (sometimes called “dot” 
or “scalar”) product. For z = (21,...,2n) , ¥ = (Yi,--+.Yn) we have 


<Z,y>= ETT (1) 


In the space C” we must modify (1) a little bit. For Z1= (2),---,2n) and 
wi = (wz,...,Wn) we have 


<z,w >=) 20; (2) 


where the bar denotes complex conjugation (Chapter Zero, Exercises 4, problein 
1). 

If we set €) = (1,0,...,0), €2 = (0,1,0,...,0), ... @n = (0,0,...,1), then 
clearly < e;, €, >= 0 when 7 # k, and this product is 1 when j = k. Also, for 
any vector u, 


n 
u=) <u,e;>e; (3) 
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With these results in mind, let us turn now to C(T). Equation (2) leads us 
to define 


<fig>= a f (t)g(t)dt for any f, gin C(T) 


Note that for any fixed n € Z, 


< eit ei >= 2a 


ent ent eimt 
and so the set 


n=m v2n vin’ v2n 


Equation (3) leads us to ask if, for f € C(T), we may write: 


end eint eint 
t= > Ne) 


n=-00 


Ine Z satisfies ( ) = 0 if m # n and is 1 if 


oo 


> i <f eint s pint 
? 


27 
n=-—-6o 


5 (EJ sioetar) 


n=-—0o 


So we are asking if f can be represented by its Fourier series, but we have not 
specified in just what sense the series is to “represent” the function. Some kind 
of convergence must be involved, but the convergence we have in mind here is 
neither pointwise nor uniform. It is a notion of convergence induced on C(T) 
by its inner product. This will be made clear below. 


1. Normed Vector Spaces 


To proceed with our new way of looking at Fourier series, we must first say 
something about infinite dimensional vector spaces. The terminology of linear 
algebra is, of course. needed here. A brief discussion of this terminology is given 
in Appendix A. 

Let us begin by recalling some conventions. If D is a nonempty subset of K 
and f,g are maps from D into K, then f +g is the function defined as follows: 
(f +g)(t) = f(t) + g(t) for all teD. Also, when AeK, the function Af is defined 
by (Af)(t) = Af(t) for all teD. When we consider a vector space of functions 
defined on D (i.e., a set of functions that has the properties listed in Definition 
1 of Appendix A), then the values of these functions determine our scalars. So 
if all of our functions are real-valued, our scalars are real numbers, and when 
all of our functions are complex-valued, our scalars are complex numbers. 
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Recall that, in very elementary physics, a vector is defined to be a quantity 
having both length and direction. In mathematics we first define a vector space 
and then say that the elements of that space are to be called vectors. We know 
nothing about them except that they obey certain axioms. This is the basis 
for the often repeated remark that mathematics is the subject in which we 
don’t know what we are talking about. Many of the vector spaces that arise 
in analysis, however, have a function defined on them that enables us to assign 
a “length” to each element of the space. Such a function is called a “norm.” 
The formal definition is this: 


Definition 1. Let E be a vector space over K. A nonnegative real-valued 
function p on E is said to be a norm on E if 


(a) p(x + y) < p(x) + p(y) for all z, y in E; 
(b) p(Az) = JA| p(z) for all A in K and all reE; 
(c) p(x) = 0 if, and only if, x is the zero vector. 


When p is a norm on E, it is customary to denote, for each reE, the 


number p(z) by ||z||. A vector space on which a norm is defined is called a 
normed space. If we want to emphasize the norm, say || - ||, on E we speak of 
the normed space (E£, || - ||). 

Let us look at a few examples. 


(i) For any fixed neN the vector space R” consists of all ordered n-tiples of 
real numbers. So if reR", then 2 = (1),...,2,). We define || to be 


n 
2 
2 
j=l 


Because Ax = (Az), .-.,AZ_) we see that 


n 4 n 4 nn 4 
nate [0%] - pS] rife =|] (Izlle 
j=l 


j=l j=) 


Also, ||z {lz = 0 if, and only if, 
> x‘ =0 
j=1 
Clearly, this will happen when, and only when, z; = 0 for each j. So the 
function || - ||2 has properties (b) and (c) of Definition 1. Property (a), the 
so-called triangle inequality, is harder to prove. We present the proof after we 
discuss inner products. 
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(ii) We can obtain an important generalization of R" by considering all “square 
summable” sequences of real numbers, i.e., all functions z : N — R such 
that 

XS 
(x(n))? < cc 
1 


n= 


Writing z(n) = 2, for each n, we are considering all sequences I= 
(x1, 22,...) such that 


The set of all such sequences is denoted by @7(N) (read “é two of N”). We 
shall see, once we discuss inner products, that £7(N) is a vector space and that 
the function 


oc 3 
om 2 
I| x ll2 = » “| 
n=1 
is a norm on this space. 


(iii) There are natural analogues of (i) and (ii) for the vector space of or- 
dered n-tuples of complex numbers, €”, and the vector space of square 
summable sequences of complex numbers. The latter is denoted by £2(N) 
or, when there is no possibility of confusion, by @7(N). In the first of these, 


however, we must define ||z ||2 = to be 


and in the second 


20 } 
lz lle = | | Zn | (see Exercises 1, problem 2). 
1 


Ns 


(iv) Of special importance for us is the vector space @'(Z) (read “@ one of Z”). 
It consists of all functions a : Z — C such that 


oS 


> | a(n) |< oo 


n=—oOo 
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Observe that when a, b are in ¢'(Z), then a + 6 is also in this set: for 


>= | (a + 6)(n) |= S© | a(n) + b(n) |< D> {| a(n) | + | b(n) |} 


= > J a(n) | + >_ | O(n) | 


and these last two sums are finite. It is also clear that when aeé!(Z), so does 
Aa for any scalar 4. One can now check that our set has all of the properties 
listed in Definition 1 of Appendix A, and so ¢'(Z) is a vector space. 


For each aeé!(Z) we define ||a||, to be 


S~ | a(n) | 


This is a nonnegative real-valued function and |ja||,; = 0 if, and only if, 
a(n) = 0 for all neZ; i.e., if, and only if, a is the zero vector in @'(Z). Also, 
|Aa@||, =| A | |lal], for any a and any scalar \. The triangle inequality (property 
(a) in Definition 1) was proved in the paragraph just above. Thus (¢'(Z), ||-|l1) 
is a normed space. 

We shall see more examples of normed spaces in the later sections. 


Definition 2. Let (E, ||-||) be a normed space and let roeE. A subset S of E 


is said to be a neighborhood of Zo if for some € > 0 we have {reE \\lz-Zoll < 
e} ¢ S. A subset of E is said to be an open set if it is a neighborhood of each 
of its points, and a subset of E is said to be a closed set if its compliment, in 
E, is an open set. 


Given z,¢€E and any € > 0, the set O, (Zo) = {zeE | ||z - - Zoll <  E} is 
called the open ball of radius ¢ centered at Zo, and the set Be (zo) = {reE | 


\|z — Zoll < €} is called the closed ball of radius € centered at Zo. Observe that 
an open ball is an open set and a closed ball is a closed set (see Exercises 1, 


problem 3). For any A C E and any scalar A, let AA = {Az | reA}. We have 
B. (0) ={zeE| [Z| <e}={(ez) [Iz <1}=eH,(0) (1) 


For any set BC Elet A+ B={z+y | zeA and yeB} 
Then 


Be (Zo) = {ek |||z - Zoll <e} (2) 
= {yeE | y = Zo + ezwith Fal < 1} = {zo} +€3;(0) 
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because 


{Zo} + eh (0) ={Zo+ez | Iz <1} (3) 


Hence we see that if we call 3,(0) the unit ball of E, any closed ball 
centered at zero is a scalar times the unit ball and any closed ball with center 
Zo is just a translate to Zo of a closed ball with center at zero. 

Similar remarks apply to open balls. 


Exercises ] 


#1. Let (E,||- ||) be a normed space and let z, y be elements of E. Show 
that | {|r| - lyll ls Ilz - yl. 
Hint: Writer =z-—yt+yandy=y-zt+7c. 

* 2. Find a vector (z,,22)eC? such that 2? + 22 = 0 and | z, |? + | 22 |? 0. 


*3. (a) Show that an open ball, say O-(Zo); is an open set. Given y €O, (Zo} ; 
find 6 > 0 such that Os (y ) CO, (9). 


(b) Show that a closed ball, say 3-(z0), is a closed set. Given y ¢ 3e(Zo), 
find 6 > 0 such that Os (9) M B¢(z0) = 


(c) Show that the union of any collection of open sets is an open set, 


and the intersection of any finite collection of open sets is an open 
set. 


(d) Show that the intersection of any family of closed sets is a closed set, 


and that the union of any finite collection of closed sets is a closed 
set. 


4. In R" the vectors e, = (1,0,0,...,0), e2 = (0,1,0,...,0), e€3 = (0,0,1, 
Oise) sucks en= (0, ...,0,1) are linearly independent. 
An infinite subset of a vector space is said to be linearly independent if 
each of its finite subsets is linearly independent. Find an infinite linearly 


independent subset of 2(N) and find such a set in @1(Z). What does this 
tell you about the dimension of these vector spaces? 


2. Convergence in Normed Spaces 


The presence of a norm on a vector space enables us to define convergence for 
sequences in that space. Once we have this concept, we shall be able to discuss 
convergent series in, and continuous functions on, the space. 
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_—. \o 
Definition 1. Let (E, ||-||) be a normed space and let { Zn} 


be a sequence 
n=) 


of vectors in E. We shall say that this sequence is convergent to the vector 
ZoeE, and we shall write Baa In = Zo, if for any given € > O there is a 


natural number N such that Zn - Zoll < € whenever n > N. 


Note that the sequence {zn} C E converges to ToeE if, and only if, the 


sequence {zn — zo} converges to the zero vector of E. As a simple application 


of this idea, let us use convergent sequences to characterize the closed subsets 


Lemma 1. A nonempty subset C of E is closed in (E, || - ||) if, and only if, 
every sequence of points of C’ that is convergent converges to a point in C. 


Proof. Suppose first that C is closed and that {zn} C C is convergent to a 


point yeE. If y ¢ C, then y is in the complement of C, E\C. which is 
an open set. Hence, for some «€ > 0 we must have O, (v) C E\C (Section 


1, Definition 2). Because {zn} converges to y, there must be an integer N 


such that £,€O, (v) whenever n > N. Thus we have reached a contradiction 
because, when n > N, z,, is in both C and E\C. 


Now suppose that C’ is not a closed set. We shall construct a sequence 
in} C C that converges to a point y that is not in C. Because C is not 


closed, E\C is not open, and hence, in particular, it cannot be empty. Thus 
there must be a point yeE\C such that every neighborhood of y intersects 
C; otherwise E\C would be a neighborhood of each of its points. So for each 


n= 1,2,... we can find T,€O1 (v) NC. Clearly, {zn} is a sequence in C whose 
limit is y ¢ C. 

The Fourier transform is of fundamental importance in harmonic analysis. 
It is a linear map between two normed spaces. Whenever we speak of a linear 
map between normed spaces E), E2 we shall always assume that these spaces 


are defined over the same field K (i.e., both are vector spaces over R or both 
are vector spaces over C). Recall that a map T : E, — Ez is said to be linear if 


(i) T (= + v) =T(z) +T(y) for all z, y in E, and 
(ii) T(z) = \T (z) for all zeE, and all scalars 2. 


Observe that T must map thie zero vector in E onto the zero vector in E>. 
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Definition 2. Let (£;,||- ||;) and (£2, || - ||2) be two normed spaces and let 
T : E, — E2 bea linear map. We shall say that T is a continuous linear map 


if the sequence {T (zn)} converges to the vector T (Zo) for || - ||2 whenever 


the sequence {z * C E, converges to the vector Zo for || - ||. 


Our first result gives some especially useful characterizations of continuous 
linear maps. It is because of the first two of these that continuous linear maps 
are often called “bounded” linear mappings. 


Theorem 1. Let (£), || - ||1) and (£2, || - ||2) be two normed spaces and let 
T : E; — Ez be a linear mapping. Then the following are equivalent: 


(i) There is a real number M such that ||T (z) lle < Mllzlli for all reE}; 


(ii) There is a real number M such that ||T (z) ll2 <M whenever reE) and 
Izllhi < 1; 


(iii) T is a continuous linear map. 


Proof. It is clear that (i) implies (ii). eas that T has property (ii). Then 
) Ma < M. Because T is linear we 


given x zreEi, iz =) 


see that ||T (2) te lz < M{|zll, for any nonzero vector zeE,. But any linear 


map takes the zero vector to the zero vector, so our inequality holds when zr 
is 0 as well. Thus (i) and (ii) are equivalent. 
Let us now show that (i) implies (iii). If {zn} is any sequence in E, 


that converges to a vector ZoeE\, for || - li, then {Zo _ Zn} converges to 


lA 


the zero vector of E, for || - |j,;. But by (i) we must have ||T (Zo — Zn) llo 
M| (Zo 2 Zn) l|1, and so it follows from the linearity of T that |" T (zn) 
T (Zo) for all || ll2. 


Finally, let us show that (iii) implies (ii). We shall assume (iii) and the 
negation of (ii) and deduce a contradiction: We are supposing that, given 
any M, we can find reE, such that ||z||, < 1 and ||T (z) l2 > M. Choose 


Z1€E; so that zal} < 1 and ||T (z1) lle > 12. Next choose Z2€E) so that 


— 


\Izoll, < 1 and ||T (z2) ll2 > 22. Continue in this way. After 21, 22,...-In 
have been chosen so that WIz5 ll, < 1 and ||T (z;) lo > j? for j = 1,2,...,n, 
choose Zn+1€E) so that ||z,+1/|) < 1 and ||T (Zns1) lo > (n +1). 
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Now the sequence { Za} C E, satisfies the inequalities ||—*||1 < 1 and 


(|T (2:) ll2 > n for all n. But then the sequence {2} C E, converges to 


—_ 


the zero vector of E, for || - ||1, whereas the sequence {7 (3) C Ez does 


not converge. This contradicts (iii) and completes the proof. 

A linear map from a vector space to itself is called a linear operator on that 
space. We have two natural linear operators on the space €? (N). They are the 
right-shift operator 

A, (2) = A, ({zn}-_,) = {0, 21, 22,...} for alll zl? (N) 
and the left-shift operator 
Ac (2) = Ae ({2n}%1) = {zn}%2p for all Zet (N) 


Clearly, ||A, (z) lle = lz Ilo for every z, and \|Ae (z) lo < || z |l2 for every 


z. Thus these linear operators are continuous on & (N). 
Because A, and A; map @? (N) to itself, we can compose them (Chapter 
Zero, Exercises 2, problem 2). We see that A;o A, is the identity operator (i.e., 


the map / such that /z = z for all z) whereas A, o A, is not. 


The space ¢! (Z) consists of all sequences z = {zn}°, such that 


oo 
> | Zn |< 00 
-—- oo 
This last number is zh, Clearly, 
den 
—0o 
is convergent and so we may define a map y: ¢! (z) — C by setting 
. oo 
e(z) = o({zn}) = >— zn 
—0o 


for every zee! (Z). This mapping is easily seen to be linear. A linear map from 
a vector space into its field of scalars is called a linear functional on the space. 
Because K has a norm (the absolute value function [see Chapter Zero, Exercises 
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3, problem 3 and Section 4], is a norm on K), we may consider the continuous 
linear functionals on a normed space. Note that y is continuous because 


1 (2) Ile (f{2n}) 1 So en IS DOL an = MZ Ih 


for every zeé!(Z). 


Now let us consider a fixed nonempty subset D of K and a function f : 
D — K. We shall say that f is a bounded function or that f is bounded on 
D, if there is a real number MM such that | f(t) |< M for all teD. Such a 
function need not have a maximum in D but the least upper bound of the set 
{| f (t) || teD}. what we call the supremium of | f |, is a real number (Chapter 
Zero, Section 3, Definition 1 and Remark (8)). 

Now let B(D) be the set of all functions from D into K that are bounded 
on D. This is a vector space over K (Appendix A) and we can define a norm on 
this space by setting ||f||,. = sup | f |= lub{| f (t) || teD} (we call this the sup 
norm) for each feB(D). Furthermore, we can easily characterize the sequences 
of functions that converge for this norm. We have: 


Lemma 2. A sequence {f,} € B(D) converges to the function foeB(D) 
for the norm || - ||. if, and only if, this sequence converges to fo uniformly over 
D. 


Proof. Let {f,} C B(D) and suppose that this sequence converges to foeB (D) 
for || - |loo. Then, given € > 0, there is an integer N such that || fo — fnlloo < € 
whenever n > N. But this says | fo (t) — fn (t) |< lub{| fo (t) — fn (t) || teD} < 
€, for every teD, when n > N. Hence, { f,} converges to fo uniformly over D. 

Now suppose that the sequence {f,} converges to fo uniformly over D. 
Then given € > 0 there is an N such that | fo(t) — fn(t) |< € for all teD 
whenever n > N. Clearly then, || fo — falleo = lub {| fo (t) — fn (t) || teD} < € 


whenever n > N showing {f,} converges to fy for || - |loo- 


In Chapter One, Section 4, Lemma 1, we showed that a sequence {f,} of 
functions defined on D is uniformly convergent over D if given € > 0 there is 
an N such that | fn (t) — fm (t) |< e€ for all teD whenever both m and n exceed 
N. It follows that a sequence {f,} € B(D) converges for || - ||. if, and only 
if, given € > 0 there is an N such that || fn — fmlloo < € whenever both m and 
n exceed N. Normed spaces that have this property are extremely important 
in both pure and applied mathematics. Before we can discuss them we must 
define our terms. 


Definition 3. Let (£, || - ||) be a normed space. A sequence {zn} of vectors 
in £ is said to be a Cauchy sequence if, given ¢ > 0, there is an N such that 
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Zn —- Z ml < € whenever m and n both exceed N. A normed space (E,, || - ||) 
is said to be a complete normed space, or a Banach space, if every Cauchy 
sequence of points of E converges to a point of E. 


We have just noted that the space (B(D),|| - |loo) is a Banach space. In 
particular, (B(T),||- ||..), T the unit circle, is a Banach space. Observe that 
the continuous functions on T are bounded and hence C'(T) € B(T). Further- 
more, any sequence in C'(T) that is a Cauchy sequence must have its limit in 
C(T). Because it is Cauchy it converges to an element of B(T) and because 
each f,, is continuous, its limit must actually be in C (T) (Chapter One, Section 
4, Lemma 2). Thus (C'(T), || - ||.) is another example of a complete normed 


space. 


Exercises 2 


«1. Let (E, || - ||) be a normed space and let {zn} C E converge to ZoeE for 


(a) Show that the sequence of real numbers {lz all} converges to the 
number || zo]|. 


(b) Show that {zn} is a Cauchy sequence (Definition 3). 


*2. Let S be a nonempty subset of a normed space (E, || - ||). A point ro of E 
is said to be an adherent point of S if every neighborhood of Zo contains 
infinitely many points of S. The set of all adherent points of S is denoted 
by S’, and the set SUS’, which we shall call the closure of S, is denoted 
by S. 


(a) Show that ZoeS’ if, and only if, there is a sequence of distinct poiuts 
of S that converges to ro. 


(b) Show that (S’)’ C S’and hence S’ is a closed set. 


(c) Show that S is a closed set and show that any closed set that contains 
S must also contain S. 


(d) If H is a linear subspace of E, show that H is also a linear subspace 
of E. A linear subspace of E that is also a closed set is called a 
closed linear subspace of E. 


(e) Show that C (T) is a closed linear subspace of (B(T) , || - ||). 


3. For each neN let fon (t) = t?" for —a < t < mw and let fon (t +27) = 
fon (t) for allt. Then {fon}?”., © C(T). Show that any finite subset of 


{ fon} is linearly independent. 
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4. Let E,, Ez be two normed spaces and let T : E; — Ez be a continuous 
linear map. 


(a) Show that the null space (Appendix A) of T is a closed linear sub- 
space of EF}. 


(b) More generally, show that for any closed subset C’ of Ez the set 
“(cy= { EE, | T (z) eC} is a closed subset of EF}. 


5. We can define two linear functionals on C'(T) as follows: 


(a) 1(f) =f", f (t) dt for each feC(T): 


(b) For any fixed s € T, let y,(f) = f (s) for each feC(T). Show that 
these maps are continuous when C’'(T) has the sup norm. 


6. Recall the space f'(Z) (Section 1, example (iv)). For each aeé'(Z) let 
F (a) = >. a(n)e'”® 


-% 


(a) Show that ¥ (a)«C'(T) and show that the map taking each a to 
F (a), call it F, is linear. 
(b) Show that F is a continuous linear mapping from (¢' (Z), || - ||1) into 


(C(T) «II + Ilse). 


3. Inner Product Spaces 


Returning once again to elementary physics and our informal discussion, we 
recall that the dot product of two vectors u,v in R?, or R?, is the product of 


their lengths and the cosine of the angle between these vectors. Thus (u ,v v) = = 


|| {||| v || cos @. Because we have an inner product on R® (introduction to this 
chapter, equation (1)) and we have a norm on R"(Section 1, example (i)), we 
u.v 
union 
out we'd have to show that the quantity in curly brackets is between —1 and 
+1; otherwise the arccosine wouldn't be real. This is, as we shall see, actually 
the case. So we can give “direction” to the vectors in R”. But this is not, as 
far as I know, very useful. What is important about an inner product is that 
it enables us to define “orthogonality” of vectors. In R?, or R?, two non-zero 
vectors are orthogonal if the angle between them is 3. In this case the cosine is 
zero, sO tle inner product of the two vectors vanishes. In the general case when 
we have an inner product defined on the vectors of a given space, we define 
two vectors to be orthogonal if their inner product is zero. This is an extremely 
useful notion as we shall see below. Let us now give formal definitions of these 
concepts. 


might define the angle between u, veR” to be cos~! { \ To carry this 
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Definition 1. Let V be a vector space over K. The set of all ordered pairs 
(u, v) whose members are in V is denoted by V x V (Chapter Zero, Section 


1, definition 5). A function y from V x V into K is said to be an inner product 
on V if 


(a) For any u), ua, v in V and any scalars a, 3 we have y (au, + Buo, v) = 
ay (a, v) + By (we, v); 


(b) For any u,vinV, Y~ u, v) = (v, u), where the bar denotes complex 
conjugation (Chapter Zero, Exercises 4, problem 1); 


(c) For all veV. yp (v. v) > 0 with equality holding when, and only when, v 
is the zero vector. 


When y is an inner prodict on V, it is customary to write (a, v) in place of 


Yp (zu, v) and to call the pair (V (,)) an inner product space. 


For the remainder of this section we shall suppose that any vectors we 
consider are members of a fixed inner product space (V. (, )). 


It is convenient to define for each veV, Teal to be (Ge *))' and to call 


this the norm of v. It is clear that |lav vl =| a al (| » || for any veV and any 
aeK, and it is clear that || v|| = 0 if, and only if, v is the zero vector. To prove 
that || - || is a norm, we must also show that it satisfies the triangle inequality 
(Defnition 1(a) of Section 1); i.e., we must show that ||u + v|| < ||ull + [lvl 
for all u, v in V. This will follow from Theorem 2 below (Corollary 1). 


A vector veV is called a unit vector when |v || = 1. Such a vector is 


sometimes said to be nome We can normalize any nonzero vector v by 


multiplying it by the scalar Ta A set of vectors is said to be normalized if 


each vector in the set is a unit vector. 


Definition 2. Two vectors u. v in V are said to be orthogonal if (a, v) = (); 
A nonempty set of vectors is said to be an orthogonal set if any two distinct vec- 
tors in this set are orthogonal. Finally, an orthogonal set that is also normalized 
is called an orthonormal set. 


It is easy to check that the functions defined by equations (1) and (2) in 
the introduction to this chapter are inner products on R” and C", respectively. 
If, for each j = 1,2,...,n, we let € ; be the vector (0,...,1,0,...,0), where the 1 


is the jth place, then cc i} is an orthonormal subset of R” and also of C”. 
jJ= 
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—. ym 
Theorem 1. (Bessel’s inequality, weak form). Let {v i} ~ be a finite ortho- 
j= 


m 
normal set in V. Then for any vector veV we have a | (v, v;) 2< |v |I?. 
j=1 


Proof. The proof is by direct calculation using the properties listed in Definitions 
1 and 2. It is convenient to set cj; = (v, 05) for j = 1,2,...,m, and to use 
different indices in our sums. We have 


m m m 
0< 8 —Sretjl? = (3 Sais -Srais| i 

j=l j=l k=1 

m m 
II» |]? Lani v)— ee (v, ve) + p» cj (05, 0k) = 
k=1 k=1 
II? - Sole - Dla P+ le 
j=) 


k=1 


because (v5.0) = 0 unless j = k. Thus we have 


wm 
O<fvl?-> le / 


j=) 


and we are done. 


Theorem 2. (Cauchy, Schwarz, Bunyakowski). For any two vectors u, v in 
V we have | (u,v) l< Julio]. 


Proof. The result is trivial when v is the zero vector (see Exercises 3, problem 
la). When this is not the case, iw 
2) ?s ull?, giving us | (uw, 0) P< ally. 
Taking square roots we have our result. 


\ is a finite orthonormal set. Thus, by 


Theorein 1 we have | (u, 


Remark 1. In our discussion of R® we suggested that one might be able to 
define the angle between two nonzero vectors u,v by using cos~! | \ 
Theorem 2 shows that when the inner product is real, the quantity in curly 
brackets is always between —1 and 1. Thus, in this case, the arccosine is always 
a real number. 


Corollary 1. For any two vectors u,v in V we have ||u + vl < |/ull + |v]. 
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Proof. First note that (u, v) +(%, u) is real and less than 2 | (u, v) |. Hence 
we may write 


ju + vl? = (u+vutv)=(au+0)+(v,u+0) = (au) + 
(u,v) +(v,u)] + (0,0) < ful? +2] (u,v) | +101? 


—> —> —> —> —)> —> 2 
But by Theorem 2 this last quantity is < || u ||?+2|| u|||| v |]+|| v ||? = (\ ull + |lv 1) 
Taking square roots we have our result. 


Remark 2. We have now shown that when (V (,)) is an inner product space 


—_ ox 


the function v > (Cv, 7)’ is a norm on V, i.e., setting || v || = ((3,7)) 


for every veV, (V, | - ||) is a normed space. We shall call this the norm de- 
fined by, or associated with, the inner product. Unless the contrary is explicitly 
stated, whenever we speak of a norm on an inner product space we shall mean 
the norm defined by the inner product. 


We have now shown, in particular, that the functions defined on R” and 
C” in examples (i) and (iii) of Section 1 are norms on these spaces. They are 
the norms defined by the inner products mentioned in the introduction to this 
chapter (equations (1) and (2)). 


Remark 3. An inner product space (V, (,)) always has, as we have just seen, 
a norm. Hence we can discuss open sets, closed sets, and convergent sequences 
in the space as well as continuous mappings on the space. 


Let us now recall the set €2(N) (Section 1, example (ii)). This consists 
oo 
of all sequences {z,}>-_, of real numbers such that > xz? is convergent: i.e., 
eee 
it is the set of all functions z : N — R such that > zr (n)? is convergent. We 
=1 
know how to add two such functions and how to multiply any sich function by a 
scalar, but are the results still in 42 (N)? Clearly, \ {an}P~, = {Azn}o, €€2 (N) 
whenever {z,},_, is in this set and AeR. Let us show that for any {z,}°~ 
and any. {yn} p_, in & (N) their sum is again in this set; i.e., let us show nes 


when Siz? < oo and 2 < oo then > (In + Yn)” is finite. 


n=1 n=) n=1 


For each fixed N we have {r,}*_, and {yn}*_, in the space R’. This is, as 
we have seen, an inner product space and so we may use the C.S.B. inequality 
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(Theorem 2) to obtain 


| cee {yn} n=1 1) |=| ache BS (338 ) (>. ‘] 


n=1 


-_ 


Letting N — oo we see that whenever {7,} and {y,} are in &(N) the 
oo 


series ) Inn iS convergent. Furthermore, because 


n=1 


N 


Yo (en +m) = +2) nin + Doak 


n=) n=] 


we see, by again letting NV — 00, that {rn + yn}?_, is in @ (N). 
It is easy to check that & (N) is a vector space over R and if we define for 
oo 


each z = {tn}*_,,y = {yn}, im this space < z,y > to be ET we 


n=l 


easily see that (€2 (N),(,)) is an inner product space. Thus || - ||2, where for 
bf 


eo) 3 
each x = {z,}™,, lz lle = (3: 2) , is a norm. 


n=1 


Exercises 3 


In problems 1-3 we assume that all vectors considered are in a fixed inner 
product space (V, (, )). 


*1. (a) Show that (u, 0) =Q= (0, u) for any u. 
(b) Prove the parallelogram identity: For any u,v 
ue + OI? + lle — oI? = 2 (al? + eI) 
(c) Prove the Pythagorean relation: if < u, v >= 0, then ||u + vill? = 
| a ||? + [|v |l?. 
(d) Prove the polar identity: For any u,v 


l — — — — ‘ —_—> —_ . —> — — 
r {il + v|[? -—lfu — v ||? + il) u +iv||? — illu - iv||?} = (u,v 


2. Show that any finite subset of an orthonormal set in V is linearly inde- 
pendent. 
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x3. Let {un} and {vn} be two sequences in V that converge, for the norm 


of V, to Uo and Vo respectively. 
(a) Show that for any wev, in es cr w) = (wo, w). 
(b) Show that | is EC Atins Us \ = Uo, Yo). 


4. On the space C'(T) define, for any f,g,< f,g > to be fr. f (t) g (t)dt. 


(a) Show that (,) is an inner product on C (T). 
(b) Denote the norm associated with this inner product by || - ||2 so 


r 4 . | 
lle = ((4,.))4 = (J%, L(t) at)”. Compute jle™*||2 and [le'™* I} oo 
(Section 2, just after Definition 3) for any fixed neZ. 


(c) Show that { e~ | neZ \ { cosnt nt\ and { singt nt \" are orthonormal 
V2n ; ™ Jnz=1 * Jn=1 
sets in (C(T),<,>). 


5. Find an infinite orthonormal set in 2 (N). 
6. Recall the set 2(N) (Section 1, example (iii). a consists of all 


sequences of complex numbers, z = {z,}, such that 3 | zn |?< 00. 


n=] 


oo 

(a) Show that for any {zn}, {wn} in €2(N), > Zn Wr, is convergent. 
n=) 

(b) For any two nonzero complex numbers z, w let a = ES. Note that 


| a |= 1 and azw =| zw |. Hence if {z,} and {w,} are in 2 (N), 
oo 


so is {a@n2,} where a, = Eo Thus we see that ) | ZnWp | is 
n=] 
convergent. 


(c) For any z = {z,},w = {wn} in @ we have defined z + w to be 
{zn + wy}, and for any AeC we have defined Xz to be {Az,} . Show 
that z + w and Az are in €2.(N) and that ¢2.(N) is a vector space 
over C. 


(d) For any z = {zn}, w = {wp} in €2. (N) define (z, w) to be >: SW ns 
n=1 


Show that (,) is an inner product on ¢2 (N) and that the norm associ- 
ated with this inner product is the one defined in Section 1, example 


(iii), 
(e) Find an infinite orthonormal subset of (@2 (N), (,)). 
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4. Infinite Orthonormal Sets, Hilbert Space 


Let us begin our discussion with a fixed inner product space (V,(,)) and an 
orthornormal sequence (i.e., a sequence that is an orthonormal set) {v;} CV. 
We have seen several examples of such sequences (Exercises 3, problems 4c, 5, 
and 6e). Suppose that for some ueV and some scalars {a,;} we have 


oc 
mm = So aye. (1) 
j=1 


— lim 
What we mean, of course, is that u = > a;u; where convergence 
rn- 


=1 
is for the norm on V defined by its inner saame Then, by problem 3a of 
Exercises 3, we may write 


li k ees 
lim : + 
a mle He] (2) 


j=! 


k 

lim - 

= ) a; (¥j,Un) =, for each fixed n. 
ik — 00 = n)= 


Thus when (1) is true the scalars are uniquely determined by U. They are 


aj = (4.05), 9 =1,2,-- (3) 


We can immediately apply our result to the space C(T). Recall that, for 

all f,g in this space, (f,g) = ff @ g(t) dt. Furthermore, the sequence 
int 

= | neZ } is orthonormal (Exercises 3, problem 4c). Now suppose that, 

7 


2 
for some feC' (T). 


ent 


f= Yoon 


(4) 
Then, as we saw above 


fie (s —) e =| f(t)e iat = Vane, (f) for every n. (5) 
n ax Jan ee n . 
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Putting this into (4) we find that 


fo (= -[ f(t) mat) - Som (f) ein" (6) 


where, we recall, that c, (f) = rr Jf (the7dt (Chapter One, Section 5, 
formula 6) 
Thus (4) is the complex Fourier series for f. 


Returning now to (V,(,)) and the orthonormal sequence { v,} we can 
generalize Theorem 1 of Section 3. 


Theorem 1. (Bessel’s inequality, strong form). For any vector ueV we have 
oo 


D1 (a, 9;) Ps ya. 


Proof. Let us set a; = (a, v;) for each j = 1,2,--- . Then 
= 


m 
| ay |?<| ay |? + | ag [P< -- Ds | a; |?<.... But, by the theorem we are 


generalizing (Section 3, Theorem 1) each term of this increasing sequence of 


oo 

real numbers is bounded by ||u||?. Thus > | (u, v5) |? converges and this 
j=l 

sum is bounded by ||u|| as well. 


O© 


Corollary 1. For any feC'(T) we have 27 (>: | cn (Ff) ') < ||f\|2 and 


—0O 


fn (3: {lan (f) [2 +1 On (4) °}) < |I7I12- 
n=) 


Thus far we have discussed what we can say when we know that the series 


(a , Uv; ) v, converges to u. There is a kind of inner product space for which 


every series formed in this way will converge; i.e., for every vector u in this 


OO 
kind of space the series ) (u, v , v, converges. 
j=l 


Definition 1. Given an inner product space (V,(,)) we recall that for each 


veV, lvl] = ((v, v) ) . We shall say that (V, (,)) is a Hilbert space if (V, || - ||) 
is a complete normed space (Section 2, Definition 3). 
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We may now state the following: 


Corollary 2. If (V,(,)) is a Hilbert space, then for every vector ueV the 
oo 


series y (a, vu; ) vj converges to a point in V. 
j=) 


Proof. Given ueV set a; = (u, v5) for each j = 1, 2,.... Then for any natural 
numbers m,n with n < m we may write 


Iai, Dail? = 3 a;v;|l 


j=n+il 
m 
2 
=( 3 an a¥s)= Stash oH 
jan+l j=n+l j=ontl 
oo 
because {e i his an orthonormal set. But by Theorem 1, the series a | a; |? is 
j=l 
m 
convergent. Hence, given € > 0 we can choose N so that > | a; |?< € when 
gent) 
So 
m 
m>n> N. It follows from (7) that >= a; vj is a Cauchy sequence in 
j= n=) 


(V, || - ||). Because this is a Hilbert space, every such sequence converges to a 
point of V. 


Unfortunately, the series 3 (u,» Vj v j» even when it does converge, does 
Pas 
not necessarily converge to u. To see why this is so, suppose that the series 
does converge to u and suppose, further, that (u, U 1) # 0. Then {a} ws is 
gz 


an orthonormal set, and the series 3 (u, vj) vz does converge, but not to ue 
j=2 
To ensure that the series converges to the vector that generates it, we must 
somehow guarantee that we have not left any vectors out of our orthonormal 
set. We can do this by requiring that our set has the following property. 


Definition 2. Let (V,(,}) be an inner product space. An orthonormal subset 
S of V is said to be a maximal orthonormal set if the conditions (i) S’ an 
orthonormal set and (ii) S € S’, imply that S = S’. 


We leave it to the reader to show { that an orthonormal! set S is maximal if 


—, 


u, v) = 0 for all veS implies u = 0 (Exercises 4, problem 1). 
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The discussion just before Definition 2 has led some to call a maximal or- 
thonormal set a “complete orthonormal set.” We shall not use this terminology. 


Hilbert spaces first arose in the papers of David Hilbert on the subject of 
integral equations, and it is well known that he was attracted to this area by 
the pioneering work of Ivar Fredholm. These spaces play an important role in 
establishing the foundations of modern quantum mechanics. Their importance 
for the study of trigonometric series stems from the following result. 


Theorem 2. Let (H,(,)) be a Hilbert space and let {a5} be an orthio- 


J= 
normal sequence of vectors in H. If this sequence has any one of the four 
following properties, it has all four of them: 


_. 1 
(a) { v i} s is a maximal orthonormal set; 
J= 


oo 
(b) For each ueH the series > (u, v;) vj converges to u; 
j=1 


oS 
(c) (Parseval’s relation) For each ueH we have I) u ||? = 3 | (a, v5) IF 


j=) 


(d) (also called Parseval’s relation) For any two vectors u,v in H we have 


oo a 
j=! 


Proof. Assume that {v;} is maximal and let u be any vector in H. We have 


oo 
seen, in Corollary 2 above, that > (u, v5) v; converges to some element of 
j=l 
HT. We must show that its limit is actually the given vector u. For any fixed n 
we have 
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and by Exercises 3, problem 3a this is 


lim: fs. aya ek SS 
Se COC) ae 
= (Hs Bn) ~ pyr cg 2a (Fs) (Bsn) = (Hs Ba) ~ (H.Fu) =O 
Thus u = ¥ (a, ¥;) v5 must be the zero vector because {oj} is a 
maximal orthonormal set (Exercises 4, problem 1). So we have shown that (a) 


implies (b). Now suppose our set has property (b). Then for any ueH we may 
write 


oS e m 
2 a ae at! Salk ak take lim = = ee. Hees 
Vay? = (zz) = (5 (3 =e i) : Be wo (Bs i 


j=1 


and again by Exercises 3, problem 3a, this becomes 


| 
= -— 
! 5 
8 
iM 
on™ 
sl 
St 
el 
—~sL_’” 
il 
3 ous 
! 5 
8 
Ms 
o™ 
el 
= 
~~” 
Pian’ 
ch 
el 


° m a 
lim Se, S35 aan 
= ) Uy, Uv; U,U; 


— 
= —_ —> 
| (u, v5) f=) (u, v5) |? 
= j=l 


w 


Thus (b) implies (c). But for any vector ueH a direct calculation shows 
that 


m 


m 

= —_ ao — —. —_ A> 

We — > (4,05) VIP = Wal? — 0 | (2,25) P 
j=l 


j=l 


It follows from this that (c) implies (b) and hence these two conditions are 
equivalent. 
Suppose now that {e 5} has property (c) and let u,v be any two vectors 
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in H. To prove that (c) implies (d), it suffices to show that (b) implies (d). Now 


foo) 
(u,v) = u,> (v, 9; Vv; 


j=l 
= cm > (v, 05) *,) 
- SER (es) LG) Ee) 
j=l j=l 
And this is (d). 


Finally, we show that (d) implies (a). We shall show that assuming our set 
has property (d), the only vector in H that is orthogonal to every v; is the zero 
vector. So suppose veH and (v, ¥;) = 0 for every j. Then by (d), we have 


oo —— 
{|v ||? = (v,v) = 30 (7,95) (9,05) 
j=l 
By our own assumption each term of the series is zero showing that || v || = 0 


and hence that v is the zero vector. 


Definition 3. Let (H, (,)) be a Hilbert space. An orthonormal sequence that 
has any one, and hence all four, of the properties listed in Theorem 2 is called 
an orthonormal basis for H. 


Let us show that the inner product space (é% (N),(,)) is a Hilbert space. 


_. 10 
Given a Cauchy sequence {zn} in this space we first note that for each 
n= 
fixed n 


In = {2n1,Zn2; In3,-.-} (1) 
Now given € > 0 we can find an integer N such that 
zn — Tmll2 <€ (2) 


when m,n > N. Using (1) we may rewrite (2) as follows: 


> (Inj — Lmj)” < e? (3) 
j=1 
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for m,n > N. For any fixed j the sequence {z,;}>_, is a Cauchy sequence of 
real numbers because 


oo 
2 2 
| nj — By [?= (taj — Fmj)” SD, (nj — Zmz) < &? 
j=l 
when m,n > N. Let us set xoj = ,"™ 2nj for each j = 1,2,.... We shall first 
show that {x0;}'° | e€% (N). Choose and fix n > N and note that for any k we 
have {xo, Va and {rn; eae in the space R*. Because || - ||2 is a norm on this 


space we may use the triangle inequality to write 


k 2 k 3 ‘. 3 
(5: 4) = (5 (ro; - “i + (5 2) (4) 
j=1 j= j=) 


Letting k — oc on the right-hand side of (4) we have, by (3), 


. \3 . 
>_ 35; $€+|[Zall2 
j=) 


Letting k — 00 we see that{ Zo} «(2 (N). 


Oo 
Finally. letting m — oo in equation (3) we obtain > (tn; — o;)° < 
j=) 
when n > N, and this says that ||zo9 — rajle < ¢ whenn > N. 


Exercises 4 


*1. (a) Let (V,(,)) be an inner product space and let S be an orthonormal 
subset of V. Show that S is a maximal orthonormal set if, and only 
if, the only vector in V that is orthogonal to every element of S' is 
the zero vector. 


(b) For each neN let en = (0,...,0,1,0,...) where the one is in the nth 
—_ oo ah 
place. Clearly, {en} _, © &(N) and also {en} C ¢2(N) (Section 
1, examples (ii) and (iii)). Show that this set is a maximal orthonor- 
mal sequence in £2,(N) and also in ¢2 (N). 
Recall the inner product on C'(T) (Exercises 3, problem 4). Show 
tnt 


: 5 | neZ > is maximal. Hint: See 
™ 


Tie 


ww 


(c 


that the orthonormal sequence 


Chapter One, Section 7. 


5. The Completion 103 


2. Let (Hi, (,),) and (H2,(,)2) be two Hilbert spaces over the same field. 
pan oo — oo 

Suppose that {5}. ; C HA, and {w;} ; C H2 are maximal ortho- 
j= j= 

normal sequences (let us stress that both of these are infinite sequences). 


oo 
For each 7 let T (v;) = Ww; and, given u= S60; 0; in H,. define T 3, 
j= 


to be S752 | aj wy. 


(a) Show that T is well defined. 
(b) Show that T is a linear one-to-one mapping from H, onto Hy. 


(c) For any vectors u, v in H, show that (a, v) = (T (x) a (7) F 


in particular, |||, = || (2) ll2 for every weFh. 


5. The Completion 


The Lebesgue integral had a profound effect on the study of trigonometric 
series. It led, among many other things. to the creation of the Lebesgue classes; 
also known as the L? spaces (read “L.P.”). Here 1 < p < ox. and for each 
such p, DL? is a Banach space whose elements are equivalence classes (Chapter 
Zero, Exercises 2, problem 1) of functions defined on, let us say, |—7.7]. For 
feL? [—7,7] the L?-norm of f is defined as follows: 


1 


isle= (fis Pat)’ 


In this context, equal functions have the same integral, so this is well de- 
fined. It is customary to “abuse language” and to refer to the ele:nents of L? 
as L? -functions. The connection to trigonometric series is this: 


Theorem 1. For each fixed p, with 1 < p < oo, and each feL? [—7. 7] the 
Cesaéro means of the Fourier series of f converge to this function for the L?- 
norm. 


When p = 2, and only then, || - |l2 is the norm associated with an inner 
product. Thus L? [—7, z] is a Hilbert space and here we have a stronger result: 


Theorem 2. The Fourier series of any function in L? |—7,7} converges to 
that function for the L?-norm. 


We shall not prove Theorem 1 (see the Notes for a reference). Theorem 2 
int )& 


20 
L? |—x,7] (Section 4, Theorem 2) . We shall do this very soon. 


follows immediately once we show that { } is an orthonormal basis for 
— OO 
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Since we are not assuming a knowledge of the Lebesgue integral, so we 
must define the L?-spaces in a different way. 


Definition 1. Let (£, || - ||) be a normed space and let D be a subset of E. 
We shall say that D is a dense subset of E or that D is dense in E if for any 


zeE we can find a sequence {¥n} C D such that Rise Yn = Z. If (V, (,)) is 
an inner product space and D C V, we shall say that D is dense in V if it is 


dense in (V, || - ||) where || - || is the norm associated with (,). 


We recall that the set of trigonometric polynomials (Chapter One, Section 
7, just before Corollary 2) is dense in (C (T) , || - ||.o). Theorem 1, quoted above, 
shows that C(T) is dense in (L?(—7,7],||- ||p) for each p. We shall use this 
fact to define the L?-spaces. 


Theorem 3. For any normed space (£, || - ||) there is a Banach space (B,, || - ||1) 
and a linear map T : E — By, such that: 


(a) For every zeE, |\z|| = ||T (z) lla; 


(b) The set (it is actually a linear subspace) T(E) = {T (z) | reE \ is dense 
in (By, || - lla). 
Furthermore, if (Ba, || - |j2) is another Banach space that has properties (a) 
and (b), then there is a linear map T’ : B, — Be that is one-to-one, onto, and 
norm preserving (i.e., ||T”’ (z) lo = |/zl], for every reB,). 


A proof of this theorem can be found in Appendix B. We call (B,, || - |l1) 
the completion of (E£, || - ||) and often say, identifying £ with T(E), that E is 
dense in its completion. 

On the space C'(T) we can define a function || - ||, by setting 


Ih = fo | f(t) | at 


for every feC' (T). It is easy to see that this gives us a norm on C'(T). The 
completion of (C (T) . || - |]1) is defined to be the Banach space L! [—7, 7]. When 
p > 1, and p F 2, it is not so easy to show that || - ||,, defined to be 


(fire Pat)” 


for each feC (T), is a norm. It is, however, and the completion of (C (T, ) || - |lp) 
is defined to be L? [—7,7]. Because we do not need these spaces, we do not 
carry this discussion any further. 
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For the special case p = 2 we have a stronger form of Theorem 3: 


Theorem 4. Given an inner product space (V (,)) there is a Hilbert space 
(H,{),) and a linear map T : V — H, such that 


(a) For all u,v in V, (Tu,Tv) = (u,v); 


(b) The set (it is actually a linear subspace) T (V) = {T (v) | vev} is dense 
in (fh, (,)) ; 


Furthermore, if (H2,(,).) is another Hilbert space that has properties (a) 
and (b), then there is a linear map T’ : H, — Hz that is one-to-one, onto, and 


for which (T'u,T'v), = (a, v) for all u, » in Hy. 


A proof of this theorem can be found in Appendix B. We call (#1, (;),) 
the completion of (V,(,)) and, as in the case of a normed space, we say V is 
dense in its completion. 


We have defined for any f,geC (T), (f,9) = J”, f (t)g(t)dt, and we have 
seen that this is an inner product C(T). The completion of (C(T),(,)) is 
denoted by L? (T) (read “L two of T”). 

The inner product and norm of L?(T), when restricted to elements of 
C'(T), is just the inner product and norm as defined above. Thus the sequence 


int \»% 
=} is an orthonormal sequence in L? (T). We shall show that it is an 
T) ~o0 


orthonormal basis for this Hilbert space. First we need: 


Lemma 1. An orthonormal sequence {v;} in a Hilbert space (H, (,)) is an 


orthonormal! basis for H if, and only if, the linear span of {9 i} (Appendix A) 
is dense in H. 


Proof. Recall that the linear span of { 03}, denoted by lin {v5}, is the set of 
all (finite) linear combinations of elements of {2 j }. 


Suppose first that {v;} is an orthonormal basis for H, and let veH be 
given. Then by part (b) of Theorem 2 (Section 4) we have 
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which is clearly contained in tin{ i}, converges to v showing that the linear 
span is dense in 7. 

Now suppose that lin {25} is dense in H. To show that {25} is an Or- 
thonormal basis for H, we need only show that it is maximal (Section 4, The- 
orem 2). 


So suppose veH and (v,0 Vv i) = 0 for every 7. We must show that v is 
the zero vector. Let ¢ > 0 be given. By our assumption there is a we lin { v v5} 


such that ||v — wi < €. Thus, by Theorem 2 of Section 3, we have 
Jol? =| (, 0) j=l (v -w, 0) |< |v - will el] < ell 


because we lin {v5} and (v, 05) = 0 for all dso (v,w) = 0. Now if v is not 


the zero vector, then || v|| < ¢ for any € > 0, which is clearly a contradiction. 


cn oo 
Theorem 5. The set {x} is an orthonormal basis for the Hilbert space 
—cO 
L? (T). 
Proof. According to Lemma 1] we need only show that the linear span of the 


given sequence is dense in L? (T). Let feL? (T) and € > 0 be given and use the 
fact that C'(T) is dense in this space to find geC' (T) such that || f — gll2 < §. 


Now the linear span of {sm} is dense in (C'(T), |] - lo.) (Chapter One, 
—0o 
Corollary 2, Section 7). Thus there is an element 7 of this linear span such that 
® 2 
llg—Tllac < Se. But then |lg—rlI3 = f%, 19 (t)—7 (t) |? dt < ghey (22) = $ 
Combining our inequalities we have ||f — 7Il2 < ||f — gllo + llg —7llo < 
5 + § =€ and we are done. 


eint ant 
Corollary 1. For any feL? (T) the series (1 a(t =) on converges to f 
7 7 


for the L?-norm. In particular for any fe (C (T)) , the series 


ES) -B(E [ema 


to.) 


converges to f for the L?-norm. For any such f we also have 


aS len(A P= [s(t Pat = IIB 
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Exercises 5 


+1. There is no inner product on L! [-7, 7] . Still, {e'"'}""__ is contained in this 
space. Show that the linear span of this sequence is dense in (L! [—7, 7], | - ll1) - 


int oo 
#2. When feC (T) the numbers a, = (=) satisfy > | an |?< oo. 
m —0o 


Given a sequence of numbers {3,}~., such that > | Bn |?< 00 we 
may ask if there is an feC (T) such that (f, =) = 2, for any n. The 


answer is “no.” Show, however, that there is an element geL? (T) such 


ent 
that ( g, —= ) = @, for all n. 
a (9 =) Bn for all n 


+3. Let C(R) be the set of all complex-valued functions that are defined and 
continuous on R. 


(a) Show that C' (R) is a vector space over C (see Section 1 for a discus- 
sion of the algebraic operations on functions). 


(b) An element feC (R) is said to have compact support if there are real 
numbers a,b such that {zeR | f (x) 4 0} C [a,b]. 
Let Co (IR) be the set of all functions in C'(R) that have compact 
support. Show that Co (R) is a vector space over C. 


(c) For each feCo(R) let ||fll1 = f°. | f(x) | dz. Show that || - ||1 is 
a norm on Co(R). The completion of (Co (R),||- {]1) is a Banach 
space that we shall denote by L' (R) (read “L one of R”). 


4. For each f,g in Co(R) define (f,g) to be | ee f(z)g (xz)dx. Show that 
(,) is an inner product on Co(R). The completion of (Co (R),(,)) is a 
Hilbert space that we shall denote by L? (R). 


6. Wavelets 


One of the major applications of Fourier series is to the analysis of real-world 


data. We have seen that is an orthonormal! basis for L?(0, 27] 


e 

or |neZ } 
(Section 5, Theorem 5) and so any function in this space can be approximated, 
as closely as we wish in the L? norm, by a finite linear combination (what 
physicists call a “superposition” ) of sines and cosines. There are many instances, 
however, when such sums fail to reflect some of the subtleties in the data. 
A better tool, in these instances, are the wavelet bases. This subject can be 
traced to some research in pure mathematics; however, many important results 
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came from the investigations of geophysicists and others interested in signal 
processing. There is now an extensive literature on wavelets and their many 
important applications, and we can only give the reader a very small sample of 
the mathematical ideas underlying the subject. 

A string of measurements, one-dimensional data, or a signal can be thought 


of as a function of time, f(t). The energy in the signal is given by / lf (t)|? dt 
—oco 

and because this is finite, f € L? (IR) (Exercises 5, problem 4). In many cases 
of interest f has compact support (Exercises 5, problem 3b) and many sharp 
spikes or sharp discontinuities. To represent such functions in a series, we seek 
a basis for L? (R), but we want our basis to represent our function at different 
scales (resolutions). In this way both gross or large-scale features and small 
subtle features of our data are captured in the representation. 


Definition 1. A wavelet is a function 7 (t) € L? (R) such that 
{24y (2t—k)|j,ke z} is an orthonormal basis for L? (R). 


Note that we have here both translations (the role of k) and dilations of the 
function wv, the so-called mother wavelet. At this point it is not at all clear that 
any wavelets exist. There are many, however, and, in applications, a judicious 
choice of y yields a powerful new way to analyze data. 

Wavelets can be constructed from a multiresolution analysis. This is, by 
definition, a sequence of closed linear subspaces {V;} of L? (IR) such that 


(i) CViCYCVC..: 

(ii) U{V; | 3 € Z} is dense (Section 5, Definition 1) in L? (R); 
(iii) O{V; | 7 € Z} = {0}; 
(iv) f (t) € V; if, and only if, f (2-2t) € Vo; 


(v) There is a function y € Vo, called the scaling function, such that 
{y(t — k) | k € Z} is an orthonormal basis for Vo. 


In applications, a multiresolution analysis is a mathematical framework 
for decomposing a signal into components of different scales. The spaces V,, 
correspond to these scales. 

If any V,, is given then by (iv), any other V,, can be found because 
Vin = {f (2™-"t) | f € V,}. Hence, a multiresolution analysis is completely 
determined by the scaling function y (sometimes called the “father wavelet” ). 
Because the translates of y give us an orthonormal basis for Vo, the translates 
of y(2"t) gives us an orthonormal basis for each V,,. Unfortunately, however, 
the union of these bases is not, in general, a basis for L? (R). To obtain an 
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orthonormal! basis of wavelets, we construct a new sequence of subspaces as fol- 
lows. For each fixed n € Z let W, = {w € Vana | (w,v) =O forall v € Va: 
W,, is called the orthogonal complement of V,, in V,4;. Every vector in V,41 
can be written, in just one way, as a sum of one vector from V,, and one from 
W,, (proved below). To indicate this, we write V,41 = Vn ® Wa. 

We can repeat this process. V, = V,-,; ® Wn-1 and so Vag: = Va-1 ® 
W,-1 ® W,. Continuing in this way we find that every vector in L? (R) can be 
written as an infinite sum of vectors one from each of the spaces W,,. In symbols 
L?(R) = 7 ® Wn. These spaces are mutually orthogonal and f (t) € W,, if, 


and only if, f (2~"t) € Wo. We see from this last fact that once we find a 
function whose translates give us an orthonormal basis for Wo, we can obtain 
from that function an orthonormal basis for every W,, and because of the 
mutual orthogonality of the W, the union of these bases is an orthonormal 
basis for L? (R). 

The problem of finding a wavelet can be now outlined as follows. Given 
a multiresolution analysis {V,,}, and recall this means we are given a scaling 
function y whose translates constitute an orthonormal basis for Vo, construct 
the sequence {W,,} and find a function ~ whose translates constitute an or- 
thonormal! basis for Wo. The function ~w is a wavelet. It turns out that such a 
wy exists for every multiresolution analysis, and the proof of this fact actually 
gives a method of constructing ~. We have: 


Theorem 1. Let {V,} be a multiresolution analysis with the scaling 
oo 
function y. Then the function y(t) = > (-1)* hy_~ (2t —k), where hy = 


-—- oo 


oo 
2 | y (t) p (2t — k)dt is a mother wavelet; i.e., {27 y(2"t — m) | m,ne Z} 
oo 


is an orthonormal basis for L? (R). 


Perhaps the simplest example of the process discussed above is found by 
using the Haar scaling function H (t), where H(t) = 1 for 0 < t < 1 and 
H (t) = 0 for all other t. It is easy to see that the integer translates of H (t) 
give us an orthonormal set in L? (R). These translates generate the space Vo. 

The function H (t) has its support in (0, 1], whereas the function H (2t — n) 


has its support in F . | These overlap when n = 0 in the set lo. | and 
when n = 1 in the set = . Thus for the Haar function ho = h,; = 1 and 


p (xz) = p(2z) — p (2x — 1). 

The function ~(z) is called the Haar wavelet. As we have already said, 
there are many others and not all of them have compact support. For more 
about this fascinating subject and for a proof of Theorem 1, we refer the reader 
to the references listed in Notes. 
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We end this section with a proof of a property of Hilbert spaces that played 
a key role in the construction above. 


Lemma 1. Let (H.<,>) bea Hilbert space, let M be a closed linear subspace 
of H, and let y € H\M. Then there is a unique vector mp € M such that 
ly — mo|| a inf {||y - m| |m E M}. Furthermore, (y _ fig, m ) = 0 for all 
me M. 


|me M} and choose a sequence {in, \ CM 


n=) 


Proof, Let d = inf {||y _m 


such that d= _|™ ly ~ mn| . Then, by Exercises 3, problem 1b, we may write 


n—oC 
| (¥ - me) + (v --mu) |) + (vm) - (¥- ms) 
— o |2 cee 2 
= 2lly - mel +29 - mil . We may write this as 
| — aa] = 2|]x — ial + 2 |] — mal] — lay - (mu + ime) | 


2 


= mt+tm 
The very last term in this expression can be written as 4 || y — mae 
Bs oi ud mtm,. . 
Now MM is a linear subspace and m;,m,, are in M, so —__* is in M. It 


2 


follows that 4 E - 


my — in| < 2||y - ma +2|/y - mil — 4d?. 


Recalling the way {ttn } was defined we see that our last inequality im- 
plies that {ttn } is a Cauchy sequence (Section 2, Definition 3). Because H 
is a Hilbert space this sequence converges, because {in} C M and M is 
closed, its limit, say Mo, is in M (Section 2, Lemma 1). Thus ly ~ mol = 
lim | - Tn = d. 

Let us now show that mo is unique. Suppose we had two vectors m1,™M2, 
in Af such that ly — mal a= y - imal. Then, as above 


|(v -m) + (yma) + (vm) - (¥ m2) 


ek ee NS sb. ce 
= 2\|y - mal +2||y — mal . This may be written as 
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—_ —_ 2 —_ m m . 
mo - m,| = 4d? -4 syygre dd nL) , but as we discussed in the first part 


ae ey: 
—_, _ —_ 2 
of the proof 4||y — ao > 4d? and so ms - ma|| < 0, giving us 


mo = mM. 
Finally, we must show that (y - mo, m) = 0 for all m € M. Let us set 


Yo = ¥ — mp. Note that for any m € M and any scalar 6,mo + 6m is in M, 
and so d? < ||y — (imo + 6m)| = [lao - sm = (Yo — dm, Yo — 6m) 


= ||vol] - 5 (i, vo) ~ 8 (ort) + 16? [ll 
— ||2 Sy. ch 2 
Because | ¥0| = 9 — mo = d* this becomes 


(x) 0 < -6 (mm, Ho) — 3 (Form) + [6 ml) 
Now suppose (¥o,m) # 0 for some m € M. Then m could not be zero 
(Exercises 3, problem la) and so we may set 6 = — 
m 
Putting this into (*) we obtain 


pe ahem) vorm)[ __[{iuin) 


Se eye ee ee 
el Il | 


Theorem 2. Let (H,(,)) be a Hilbert space and let M be a closed linear 
subspace of H. Then M+ = {ve H |< v,m>=0 for all m} is a closed 


linear subspace of H. Furthermore, (a) MN M+ = {o}, and (b) each v € H 


can be written in one, and only one, way as a sum of a vector in M and a vector 
in M+. 


, which is a contradiction. 


Proof. The proof that M+ is a closed linear subspace we leave to the reader 
(Exercises 6, problem 1). If v € MOM}, then (v, v) = 0, which gives v= 0 
(Section 3, Definition 1c). 

For any v € H\M, there is a unique moM such that 
lv — mo|| = inf {||» - m| | me M} and, as we showed in Lemma 1, 


ah 


v 
(¥ —mo,m) = 0 for all m € M. Thus ¥ — mo is in M+, ie., 0 — mo = P for 
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some vector p € M~. It follows then that v = Mo + P, where Mo eM, p € 
Mt, 
Suppose now that v = = Mo +P ’ where No € AM. p E M +. Then mo +p= 


My + Ps SO (70 - “7o) = (P- p ). Call this vector h. Clearly h eM, 


because fe Mo — My, and h € M+, because h = > — p. Thus, by (a) he = 0 
sal os —_—/ 


and we have mo = Mo9,P =p. 


Exercises 6 
1. Let (H, {,)) be a Hilbert space and let M be a closed linear subspace of 
H. 


(a) Show that M+ = {we H | (w,v) =0 for all V € m} is a closed 
linear subspace of H. 


(b) Define M++ to be (M+)~ and show that M++ = M. 
Notes 


1. A discussion of the foundations of quantum mechanics can be found in 
Introduction to Hilbert Spaces with Applications, by Lokenath Debnath 
and Piotr Mikusinski (Academic Press, 2nd edition, New York, 1999). See 
Chapter 7. 


2. A proof of Theorem 1 (Section 5) can be found in Functional Analysis, 
by Carl L. DeVito (Academic Press, New York, 1978), pp. 92-93. 


3. Wavelets are discussed in the book listed in note 1 above. See Chapter 
8, and for a proof of Theorem 1 (Section 6) see page 441 (their nota- 
tion is slightly different from ours.) Wavelets are also treated in Wavelets 
and Operators, by Yves Meyer (Cambridge University Press, Cambridge, 
1992). in A Mathematical Introduction to Wavelets, by P. Wojtaszczyk 
(Cambridge University Press, Cambridge, 1997), and in Wavelets— 
Algorithms and Applications, by Yves Meyer, translated and revised by 
Robert D. Ryan (Society for Industrial and Applied Mathematics, Philadel- 
phia, 1993). This last reference contains an interesting discussion of the 
functions of Riemann and Weierstrass (see the discussion at the end of 
Section 4, Chapter 1) and their connections to wavelets. See pages 115- 
118. 


Chapter Four 
The Fourier Transform 


In the last three chapters we have been concerned with the Fourier series and 
the problem of recapturing a function from its Fourier series. Here we shall 
change our point of view and investigate the properties of a certain mapping 
between pairs of “function spaces.” By the term “function space” we mean a 
collection of functions that is a vector space over C when addition and scalar 
multiplication are defined in the “usual” way. The mapping, called the Fourier 
transform, is the object of interest here. 

There are a number of natural questions that arise at this point. Why 
change the emphasis in this way? What, if any, is the advantage in doing so? 
At this early stage these questions are hard to answer. Suffice it to say that 
the Fourier transform plays a central role in many areas of engineering, of 
physics, of chemistry, and even, in recent times, of astronomy. We discuss some 
of these applications later. So the effort involved in understanding this mapping 
is well justified. Moreover, the associated mathematics is fascinating and very 
beautiful. 


1. The Fourier Transform on Z 


We recall the space £'(Z) = {a : Z + C | Lla(n)| < 00} defined in Chapter 
Three, Section 1, example iv, and the space C(T) of continuous functions on 
the circle. There is a natural mapping between these spaces: For each a € €!(Z) 
we set 9(a) = °°. a(n)e'"® and note that because )> |a(n)| < 00, the series 
converges uniformly over T to an element of C(T). 

The map & is called the Fourier transform on Z. Its properties are easily 
proved, and so our results here provide us with a model for the discussion of 
this mapping in more complicated settings; i.e., we shall seek analogues of the 
theorems proved here for the Fourier transform on R and on Z,, (defined later). 
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Lemma 1. The map 3%: é!(Z) — C(T) is linear and has an inverse. 


Proof. For any a,b in @'(Z) we have (a + b)(n) = a(n) + b(n) for all n € Z. 
Thus 
3(a + 6) = D(a + b)(n)Je*”? = Tla(n) + b(n)Je'”? 
= Da(n)e'”® + b(n)e'”?| 
= Za(n)e'™ + Dd(n)e*”®? = 3(a) + 9(d) 
It is easy to see that given A € C, 3(Aa) = AG(a) for every a € £'(Z) and soS 
is a linear mapping. 
We have already noted that for any a € ¢'(Z) the series Na(n)e*”® converges 


uniformly over T. If we denote this sum by a(@), then (Chapter One, Section 
5) it follows that 


a(n) = > aG(0)e-*"? de, for every n€ Z 


Thus 3 is one-to-one and, in fact, 3~! exists and is given by 
ips 1 et. 
3-[a(0)] = {5 | aloye-™*a0}e_., 


It is convenient to denote 9$(a) by a(@), or just @, and we shall often use 
this notation below. 


Recall that 2'(Z) and C(T) both have norms (Chapter Three, Section 2, 
Definition 1). For each a € é'(T), |lal], = >> |a(n)|, and for each f € C(T), 
IIflloo = maxger |f()|. 


Lemma 2. The map $ : @'(Z) — C(T) is continuous and, in fact, is norm 
decreasing; i.e., ||al|o. < |lall; for all a. 


Proof. 
en |) = ind 
I[S(@)lloc = [|@lloo = max | >| a(n)e' | 
<> la(n)| le*llao < D7 la(n)I = Hlalh 
The continuity of 3 now follows from Theorem 1 of Chapter Three, Section 2. 


There is a very useful operation on @'(Z) called “convolution.” It is a kind 
of multiplication. 


Definition 1. For any a,b in é'(Z) and any n in Z, we define (a*b)(n) to be 
> mao 4(n—m)b(m). This sequence {(a*b)(n)}>_.,, is called the convolution 
of a and b, or a convolved with 6, and is denoted by a * b. 
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We must show that a «0 is in @'(Z) whenever a and b are both in this 
space. The next lemma proves that and gives us a bound for the norm of the 
convolution. 


Lemma 3. For any a,b in é'(Z) the convolution, a+b, is in @'(Z) and ||a*b||, < 


lalla - [lblfi. 


Proof. Because ||a||, = >>“. |a(n)| we have, for any fixed m, |lal]) = >> |a(n - 
m)|. Now 


la + bil, = > (a *b)(n)| = D0 a(n — m)b(m)| 
SDF Dd la(n — m) | [b(m)| = 0 1b(m)1(}_ Ja(n - m))) 
= [lal], >> [6(m)| = lalla loll, 
The function e(n) = 1 when n = 0, e(n) = 0 when n # 0 is clearly in é)(Z). 
Note that a * e = e «a =a for all a and, further, e(@) = 1 for all 0 € T. 
Two functions f,g in C(T) can be multiplied. We simply set (f - g)(@) = 
f(9)g(@) for all 6 € T and observe that f -g € C(T). The Fourier transform, 


besides preserving sums and scalar multiples (Lemma 1), also preserves ”mul- 
tiplication.” More precisely: 


Lemma 4. For any a,b in @1(Z), S$(a* b) = S(a) - S(b). In another notation: 
axb=a-b. 


Proof. In the expression 


a* 2 * (8) = a > a(n — mon) be 


n 


we set k = n —™m to obtain 


1s { Sateen} etree en { Datoemee | 
k m k m 
= {Sate} 9» vimyine| = 4(0) - (8) 
k m 


To see how all of this might be useful in the study of Fourier series, consider 
the following result of N. Wiener: Suppose that f € C(T) is never zero, and 
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the Fourier series of f is absolutely convergent. Then the Fourier series of 3 ; is 
absolutely convergent. 

As one might expect, a proof of this result using classical methods is quite 
difficult. The reader may want to try to find one. Our discussion of the Fourier 
transform, however, enables us to reformulate the theorem as follows: 

It is clear that a function in C’(T) is in the range of & if, and only if, its 
Fourier series is absolutely convergent. Thus the hypothesis of Wiener’s theorem 
tells us that there is an a € @'(Z) such that a = f. Suppose now that we can 
find b € é'(Z) for which a * 6b = e. Then S(a*b) = a@-b= S(e) = 1; ie, 
a(6) (8) = f(@) - (6) = 1 for all 8. Because f(6) is never zero, this says that 
b= 1. Thus 4 7 isin the range of %, it is equal to b, and hence its Fourier series 
is absolutely convergent. 

In other words Wiener’s theorem, a theorem about Fourier series, is equiv- 
alent to the following: If a € @'(Z) and G is never zero, then there is a b € £!(Z) 
such that a * b = e; ie., if a € @'(Z) and @ never vanishes, then a has a “mul- 
tiplicative inverse” in @'. We shall investigate this result more fully in the next 
section. To begin, let us define our terms. 


Definition 2. An element a of @'(Z) is said to be invertible if there is an 
element 6 in @'(Z) such that a+b = b*a = e. We call b the (see Exercises 1, 
problem 2(e)) inverse of a and denote it by a7! 


It is clear that if a € é' has an inverse, then @ is never zero, because 
axa! = a(a—') = € = 1. What is difficult is showing that when @ is never 
zero, a has an inverse. 


We end this section with a proof that (€'(Z), ||-|],) is a Banach space; i.e., 
every Cauchy sequence of points in this space converges to a point of this space 
(Chapter Three, Section 2, Definition 3). 


Lemma 5. The space (é!(Z), || - ||,) is a complete normed space. 


Proof. Let {a"}°%_, be a Cauchy sequence in @'(Z) and, given € > 0, choose 
N so that 


lla" -a™hi <e 


whenever both m and n exceed N. Then for any fixed jp and any fixed k with 
jo < k, we have 


la"(Jo) -— a™(Jo)| < 3 la"(j) -a™(j)| < 3 ja"(j) -a™(j)| <e 


j=—k 


whenever both m and n exceed N. It follows that {a"(j)}92, is a Cauchy 
sequence in K for each fixed j. Because K is complete, this sequence converges. 
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Let b(j) = lim,—. a"(j) for each j € Z. Now things get a little bit messy. We 
let 
by = {--- ,0.b(—k), b(—& + 1),--- .b(0), B(1),--- ,b(A),0,--- } 
and we let, for each 72, 
k = {. a ,0,a"(-k), wie ,a"(0),- Pa ,a"(k), 0, cite } 


Then by and 6? are in ¢'(Z) and we have 


k 
NOella < [lox — Oeil + log Se+ S— la")! 


j=-k 


whenever n > N. If we fix n > N and let k tend to infinity we find that 


Y- lb) < e+ lla" lh 


— co 


showing that 6 = {b(j)}%, is in @'(Z). 


Finally, we shall show that lim,—. ||a” — 6||; = 0. As we saw above. for 
any fixed k, 


k 
dla") -a™(A) <e 
j=—k 
whienever both m and n exceed N. Letting 7m go to infinity we get 
k 
= la"(j) - WA) <e 
j=—k 
whenever n > N. Now choose, and fix, n > N and let & go to infinity. We find 
lla" — bl, Se 


whenever n > N. 


Exercises 1 


1. Let a € €'(Z) and, for a fixed m € Z, let a,,(n) = a(n — m) for all n. 
Show that @,,,(0) = e+"8a(@). 
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+2. For any a,b,c in é'(Z) and any \ € C show that: 


(a) axb=bxa: 

(b) A(a * b) = (Aa) *b = a (AD); 
(c) a*(b*c) = (ab) *¢; 

(d) ae(b+c) =atb+arc. 


(e) Show that the inverse of a, when it exists, is unique. 


«3. Let {b,} C é' and suppose that limb, = bo for the é!-norm. Show that 
lima * 6, = a * bo for any a € @'. If a € @! is fixed, show that the map 
T, : €' + @ defined by T,(b) = a + 6 is linear and continuous. 


4. Let c € ¢! be defined as follows: c(n) = 1 ifn = 0 or n = 1, and c(n) =0 
for all other n. 


(a) Show that (c *a)(n) = a(n) + a(n — 1), here a € é! is arbitrary. 
(b) Let cl! =c. c? =cx*c, .... Show that for any fixed m > 0 


rn! 


c™(n) = (0) = agen 
5. Define g € @'(Z) by letting g(1) = 1, g(n) = 0 forn £1. 


x(a) Let g! = g, g? = g*g, ... and show that for k fixed, g*(n) = 1 
when n = k and g*(n) = 0 when n $ k. 


«(b) Let g-!(n) = 1 when n = -1, g~!(n) =0 for n ¥ 1. Show that g™! 
is the inverse of g. 


«(c) Compute (g~!)* for each positive integer k. 


+(d) For any fixed a € é! show that a = °°. a(n) g” where convergence 
is for the ¢}-norm; here g® = e. 


(e) Given a,b in @! use the series representation of these elements given 
in (d) to find a series representation for a + b. 


(f) Use the representation given in (d) to show that to every continuous 
linear map y : é'(Z) — C there corresponds a unique sequence {cy} 
such that 


i. yp(a) = ancy for alla e€ é!; 
ii. sup{|c,| | 2 € Z} is bounded. 


6. If f € C(T) is never zero, show that ; is in C(T). 
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2. Invertible Elements in £'(Z) 


We recall that e(n) = 0 when n # 0, e(0) = 1, and an element a of @' is 
invertible if there is an element a~! € @! such that axa~' =a“! *a=e. 
For a € é', a £0, we set a° = e, a! =a and, for any integer k > 1, we set 


k —axaq*—!, 


a 


Theorem 1. Let a € £'(Z) and suppose that |la||,; < 1. Then the element 
(e — a) is invertible. Furthermore, if a # 0, 


(e-a) = Le’ 


where convergence is for the €!-norm. 


Proof. For each n > 1 let us set b, = 0 a) and consider the sequence 


{b,} ¢ é'. When m > n we have 


Ibm — bal =I Do @lh s do lle’ 


j=nt+l jg=n+l 


< > lla||2_ (Lemma 3 of Section 1 of this chapter) 


n+ 


< 3 llal|? < lal 07 ¥ 
1 1- lal 


n+l 


because the series of real numbers Lo \la||2 is geometric, and because |lal]; < 1, 
it converges to ;—jq7;. But, again because |lal]) < 1, lima—co |lalli*’ = 0 


showing that {b,} is a Cauchy sequence in (@', || - ||,). By Lemina 5 of Section 
1, this sequence converges to an element of £! that we shall denote by 


90 
de 
j=0 


We shall now show that this sum is the inverse of (e — a). We have 


(e —a) * (S:e') - (e-—a)* (2 lim | 


ee . _ ] . 
= tim | C a) * ye | (Exercises 1, problein 3) 


n n 
= lim ) a? — ) aJ*!!) lim (e —a"*') =e 
N—00 é n—~00 
4=0 j=0 
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By Exercises 1, problem 2(a), reversing the factors give the same result. 


Corollary 1. Let a € ¢'(Z) be invertible and suppose that b € @! satisfies 
lla — dll, < ler Then 6 is invertible. 


Proof. We have ||e—a7! +b||, = ||a~!*(a—)||, < |la7* |]: |]Ja—8||, < 1 by Lemma 
3 of Section 1 and our hypothesis. Thus, by Theorem 1, [e—(e—a~!*b)] = a~'*b 
is invertible. Call this c. So c = a~! * b and c is invertible. But then b = a«c, 
and this last convolution is invertible (its inverse is c~! « a~'). 


It follows from Corollary 1 that the set of all invertible elements of é' is open 
and hence that its complement, what is sometimes called the set of singular 
elements of @!, is a closed set. 


Definition 1. Recall that a linear map from ¢'(Z) into C is called a linear 
functional on @'(Z). A linear functional y, that is not identically zero, is called 
a multiplicative functional if y(a + b) = y(a) - y(b) for all a,b in é'(Z). 


If » is a multiplicative functional, then y(a) # 0 for some a. Consequently, 
y(a) = yp(a*e) = y(a)- y(e), and we see that y(e) must be one. Furthermore, 
if b is invertible, then y(b) # 0 and, in fact, p(b~!) = moe Theorem | tells us 
more about these functionals. 


Corollary 2. If y is a multiplicative functional on ¢'(Z), then for every a in 
this space we have |y(a)| < ||a||,. In particular, every multiplicative functional 
on ¢'(Z) is continuous on this space (Chapter Three, Section 2, Theorem 1). 


Proof. We argue by contradiction. Suppose that for some a we have |y(a)| > 
llall1. Set A = (a) and note that because ||%||, < 1, Theorem 1 tells us that 
(e — ¢) #0 is invertible. But then y (e — $) # 0 showing that because y is a 
linear map, y ($) # 1. This contradicts the definition of 4. 


Let us now choose, and fix, a multiplicative functional y on é' and consider 
M(y) = {a € é' | y(a) = 0}. The properties of M(y) are easily proved but 
worth listing. 


(a) M(¢) is closed. 


Suppose {a,} C M(y) and lima, = ap. Then lim y(a,) = (ao), and 
clearly this gives us y(ao) = 0; i.e., a9 € M(y). 


(b) M(y) is a linear subspace of @!, M(y) # @!, and if N is a linear subspace 
for which M(y) C N, then either M(y) = N or N = @!. 
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Because ¢ is linear, it is clear that M(y) is a linear subspace of ¢', and 
because ¢ is not identically zero, M(y) # @'; in fact, we may choose 
ao € é! such that y(apo) # 0. Now given b € é' set a = eh. Clearly 
p(b — aag) = 0 so b— aag = c E M(¢), and we see that b = c+ aag. Now 
suppose M(y) C N. If M(y) = N, we are done. If there isa b € N\M(y), 
then 6 = c + aap for some scalar a and some c € M. Because a # 0 and 
N is a linear subspace, we see that a9 € N. But then N = @! as claimed. 


(c) Ifa € M(p) and be é', then b*a € M(y). 
Because (b * a) = y(b) - y(a) this is obvious. 


Definition 2. A nonempty subset J of @'(Z) is said to be an ideal in é'(Z) 
if (a) J is a linear subspace; (b) whenever a € J and b € é'(Z), b*a is in J. An 
ideal J is called a proper ideal if J 4 é'(Z). Finally, a proper ideal is called a 
maximal ideal if the only ideals that contain it are itself and ¢'(Z). 


We observe that a proper ideal cannot contain e nor can it contain any 
invertible element. Thus if J is a proper ideal, then J is contained in the set of 
singular elements of ¢! (defined just before Definition 1). Because this last set 
is closed, we see that the closure (Chapter 3, Exercises 2, problem 2) of J is 
contained in it. We leave it to the reader to show that the closure of an ideal 
is an ideal and we have shown: 


Lemma 1. The closure of a proper ideal in @' is a proper ideal in @!. In 
particular, every maximal ideal in @! is closed. 


Let A be a nonempty family of nonempty sets. A subfamily NV of A is said 
to be a nest if for any sets A, Bin N either AC Bor BC A. A nest M is called 
a maximal nest if the conditions (i) NV is a nest and (ii) M C N together imply 
that M = N. So a maximal nest M is as large as possible; more precisely, if 
A ¢ M, then there must be a set B € M such that neither AC Bnor BC A 
is true. 

There is a useful equivalent form of the axiom of choice (Chapter Zero, 
Section 2) that was first proved by Hausdorff. Maximum Principle: Given any 
nest NV in A there is a maximal nest M in A such that VC M. 


Lemma 2. Every proper ideal in @'(Z) is contained in a maximal ideal. 


Proof. Let I be a proper ideal in @'(Z) and let Z be the family of all proper 
ideals in this set. Then {J} is a nest in ZJ. By the maximum principle there 
is a maximal nest M in JZ that contains {J}. It is convenient to index the 
ideals in M (Chapter Zero, Section 2, just after Definition 3). So we may write 
M = {I,|a € A}, and we know that because {J} C M, I = I, for some 
a € A. Now let J = Ue, Ja (Chapter Zero, Section 2, just after Definition 3). 
It is clear that J C J. We shall show that J is a maximal ideal. 
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Our first task is to show that J is a linear subspace of @1(Z). So given a,b 
in J we must show tliat any linear combination of these two vectors is in J. But 
we have a € J, and 6 € Ig for a,@ in A. Because M is a nest, either J, C Ig 
or Ig C [,. Let us suppose the former. Then a and 0 are in Ig, and because Ig 
is a linear subspace of @!(Z), any linear combination of a and b are in Ig. But 
Ig C J so we see that J is a linear subspace of @!(Z). 

Next we must show that for any a € J and any c € @)(Z), ac is in J. 
Now a € I, for some a and because J, is an ideal a ¥c is in this set. Thus 
axc€ I, C J; and we have shown that J is an ideal in @'(Z) that contains J. 

Now J = Uc, Ja and each I, is a proper ideal. Thus e ¢ I, all a € A, 
and so e ¢ J and we see that J is a proper ideal. 

Finally, we must show that J is a maximal ideal. Suppose J’ is a proper 
ideal in @'(Z) and suppose that J ¢ J’. Then M U {J’} is a nest in Z and 
MC MU {J'}. Because M is a maximal nest, we must have M = MU {J’}; 
thus J’ € M and so J’ = I, for some y € A. Clearly, we have reached a 
contradiction because I, C J. 

Our discussion above, see (b) and (c), shows that the null space of any 
multiplicative functional is a maximal ideal in é'(Z). Using some abstract al- 
gebra and some complex function theory (Liouville’s theorem), one can show 
that the converse is true. Because we are not supposing that our readers are 
familiar with these topics we shall only state: 


(*) Any proper closed ideal of £1(Z) is contained in the null space of a mul- 
tiplicative functional on é)(Z). 


Theorem 2. Given a maximal ideal M in é'(Z), there is a @ € T such that 
M = {a € @(Z) | @(8) = 0}. 


Proof. Let y be a multiplicative functional on @! such that M = {a € @! | 
(a) = 0}. Such a y exists by (*). Now recall the element g defined in Exercises 
1, problem 5: g(1) = 1, g(n) = 0 for n # 1. Because |y(a)| < |lal|, for every 
a € é' and because ||g/|) = 1, we see that A = y(g) must satisfy |A| < 1. But 
llg~"ll, = 1 so || < 1 also and we conclude that A = y(g) = et for some 
fixed 6 € T. 


Now for any a € £! we have 
ke k 
y(a) = (3 » ain) = jim » (3° any 
ke oo - 
= jim > a(n)p(g)” = Dee 


Thus y(a) is just @ evaluated at 0, and so M = {a | y(a) = 0} = {a € £' | 
a(@) = 0} as claimed. 
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Theorem (N. Wiener). If f € C(T) is never zero and if the Fourier series of 
f converges absolutely, then the Fourier series of + converges absolutely. 


Proof. We cannot claim to have here a complete proof because we simply stated 
(*) and it is crucial. We are only giving a sketch. As we have seen we can choose 


b € é' such that 6 = f and our hypothesis tells us that b is never zero on T. 
Suppose that b does not have an inverse in @', and let J = {a*b| ae @'}. 
Clearly, J is an ideal and because e ¢ J, it is a proper ideal in @'. Thus there 
is a multiplicative functional y such that J C M = {a € @! | y(a) = 0}. Then 
for some 9 € T we must have M = {a € @' | a(6) = 0}. But this says b(0) = 0, 
which is a contradiction. It follows that 6 is invertible in é}. ie., there is a 
b-) € é' such that b-! « b = e. Clearly, (b-! « b) = € or (b7!)- f =1. 


Exercises 2 
«1. For any fixed a € @'(Z), let I(a) = {a *b| bE é'(Z)}. 


(a) Show that J(a) is an ideal in @!(Z) that contains a. 
(b) Show that J(a) is a proper ideal if, and only if, a is not invertible. 
(c) Show that the closure of a proper ideal is a proper ideal. 
2. For any a € @!(Z), let o(a) = {A € C| (a — Ae) is not invertible}. We call 
a(a) the spectrum of a, and we call C\o(a) = p(a) the resolvent set of a. 


(a) Show that o(a) is a closed set by showing that p(a) is an open set. 


(b) Show that any A € C such that |A| > |la|l: is in p(a) and hence 
a(a) C {AE Ch A] < fal}. 


3. The Fourier Transform on R 


The theory here is very much like that for the case of Z except that the 
definitions, and the proofs of the theorems, involve subtleties not found in 
the discrete case. Let us begin informally, pursuing, as much as possible, the 
analogy with é!(Z). Because this space consists of all a : Z — C such that 
y~ ~~ _. Ja(n)| < 00; we “should” define L'(IR) to be 


n=-—oo 
(f:R—C| | [f(eldt < 00} 
Assuming that this is a vector space we set, for all f € L)(R), 


fll, = | ” 1p(t)lat 
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With a “proper” interpretation of the integral, we can show that (L!(R), |] - ||1) 
is a Banach space. (See Chapter Three, Exercises 5, problem 3(c) for a con- 
struction of this space.) The integral needed is that of Lebesgue, an extension 
of the classical integral of Riemann. We shall not assume a knowledge of this 
integral and so our proofs of the theorems stated below are only "sketches”; 
i.e., they are incomplete. 


For each f € L'(IR) we define 
Fic) = [ s(tyeae 


Observe that f is also defined for all real numbers. This is a problem, and 
it is wise to distinguish the domain of f, which is R, from that of f. We shall 
do so by agreeing that fi is defined on a “second” real line that we shall denote 
by R. 


For each a € f'(Z), @ is a continuous function on the unit circle; i.e., @ is 


an element of C’(T). Let C(R) be the vector space of all continuous functions 
on R. Then we have: 


Lemma 1. For each f € L'(R), f E C(R). Moreover, the map & that takes 
each f to f is linear and one-to-one. 


Proof. Let f € L'(R) be given. Then 


f(z a h) = f f (tet t+") ae = FE. e'"! F(t)e'* "dt 


—o 


Hence 
fle +h) - f(z) = / (e'* _ rel F(e)at 


and 


~ 


Fle +h)- fle < [ ” Lf (e)| fet — tat 


As h -+ 0 so does [e'”* — 1| and. one can show, so does the integral. Thus 
f © C(R). 

It is easy to see that 3 : L'(R) — C(R) is a linear map. To show that 
3 is one-to-one, we need only show that one can recapture f € L)(R) from 
a knowledge of its Fourier transform f. To do this one uses a kind of Cesaro 
convergence for integrals. We have 


R 
gt) = jim 5 (1 z *) Fla)e-***ax 
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where convergence is for the L'!-norm (compare this to Chapter One, Exercises 


6, problem 3). 


Note. When f happens to be integrable this formula gives 
= aes ~—izt 
st) = 5. | Fleye**taz 
which is valid in most practical applications. 
Lemma 2. For any f € L'(R), limyz)—.c0 f(z) = 0. Furthermore, 


max |3(f)(x)| = max |f(zx)| < |lflh 
rER zER 


Proof. The first statement is called the Riemann-Lebesgue theorem. It gener- 
alizes the theorem of Riemann given in Section 5 of Chapter 1 (Theorem 1). 
We have 


f(z) = | f(t)e*dt and — f(r) = / ~ et@(t+3) F(t)dt = | ” eit! f(t— =)dt 


-ac — 0 


Hence 


aiflayis | igit)- s(t *)lae 


For any f € L'(R) this last integral tends to zero a |z| — oo. 
We leave the second statement to the reader. 


Let us illustrate our results by calculating the Fourier transform of some 
specific functions. In these examples a is a fixed, positive, real number. 


(a) Let U(t) = 1 when t > 0, U(t) = 0 when t < 0. Set f(t) = e~*U(t). 
/ if (t)|dt = [ e-tat = + 
—0o 0 a 
so f is in L}(IR). Now 


fia) = | eu(neat = | ef-atiz)tdy 


— 00 0 
—— 1 | l 
= lim |———— - ——]| = —— 
M--co | —a+i2z —~a+ir a-iwz 


Note that jf is continuous and that lim)z)—..0 | fi (x)| = lim res = 0. 
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(b) Let 1,(¢) = 1 for |t] < a, 1,(t) = 0 for |t| > a. 


a taz —3az . 
pe elt — ¢ 2sinazr 
1,(z) — | e=! dt = Sa = 
a 1x r 


Again we note that the Fourier transform is continuous and tends to zero 
as |r| — oc. 


(c) Referring to example (b), let A(t) = (1 — {t|)1,(t). Because 4(—t) = k(t) 
we see that 


1 i 
k(x) -/ k(t) cosxtdt =2 | (1 — t)coszrtadt 
1 0 


= [==] ee =) 


x? x? 2 x/2 


(d) Consider the Abel kernel f(t) = e~@!4l. 


is oS 0 o.) 
f(z) - | eneettat = f eferisnars f e(—atiz)t yy 
—x 


= 0 
l e(atiz)M e(—atiz)N 1 
= lim —_—- —- ——| + lim | ———— - - 
M—-o |at+iz a+itz N—oc| —Q+22zr —a+iz 
= 2a 
~ at +2? 


(e) Here we simply note that when f is differentiable we have 


= fiz) = < / - f(t)e**dt = / 


oo) 


(it f(t)je'**dt 
oo 
which is the Fourier transform of 7 t f(t). More generally, le f(z) is the 


Fourier transform of (it)" f(t). 


In the space é'(Z) we found a kind of multiplication. There is an analogous 
operation here. 


Definition 1. For any f, g € L'(R) we define 
(f+ o)(e)= [| f(e-v)aluddy 


Lemma 3. For any f,g in L'(R) the function f +g is in L'(R) and || f +l, < 
Ifa flglh. 
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Proof. 
Fea = [isteovaride = [1 f 2 - watwaviae 
< fae [is(e-vllowlev = [ lowley f Ire - was 
=o [ (ste v)lde = Hol 


assuming the interchange in the order of integration is valid. 


Lemma 4. For any f,g in L'(R) we have 3(f «g) = 3(f)S3(g) or, in another 
notation, (f *g) = f -g. 


Proof. (Ff *9)(z) = [(f *g)(t)e**dt = f e'** (f f(t — u)g(u)du) dt setting w = 
t — u this becomes 


[esermdu [ 00)g(updw = [etoluyau fe f(w)dw 
= G(z) - f(z) 
A typical application of the Fourier transform is to solve differential equa- 
tions. As an illustration of this consider 
dus 
~The + a*u(t) = f(t) 
where f(t) is assumed known. Applying 9 we get 
$|-—> + au = —(iz)?i + ai = (rc? +0?)a =f 
Thus 
Be ] ~ ape 
ite) = (spa) fle) = 9@)F@ 
where, from (d), 


l 
t) = —ealtl 
g(t) 50° 


It follows, then, from Lemma 4 that 


u(t) = = 


alt} co 
ath = ge f etls(eas 
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Exercises 3 
1. Let f(t) = e2**1,(t). Compute f(z). 


2. Let f € L}(IR) and let a,h be real numbers. Show that 


(a) Fora #0, Sf (at)] = F(Z); 

(b) S[f(t — h)] = et** f(z); 

(c) Sfe'** f(t)] = f(x +a); 

(d) 9{(cos at) f(t)] = 3 f(z - a) + 3 f(z +a). 


3. For any f,g,h in L}(R) and any 4 € C show that 


(a) feg=grf; 

(b) A(f *9) = (CAS) *9 =f * (Ag); 
(c) f *(g*h) = (f *g) ¥h; 

(d) fe(gth)=fregt fh. 


«4. Show that there is no function e(t) € L'(R) such that e+ f = fxe=f 
for all f € L'(IR). Hint: If e exists, then e * e = e. Now apply the Fourier 
transform. 


4. Naive Group Theory 


We can get a deeper understanding of the Fourier transform by using some ideas 
from modern algebra. These are usually discussed abstractly, but, we believe, 
it may be better to begin with some concrete examples. 


Definition 1. A nonempty subset A of C is said to be an additive group if 
(a — b)€A whenever both a and 0 are in A. 


It is clear that C,R,Q,Z and even {0} are additive groups. The last of 
these is called the trivial group. We might also recall that, for any function 
f :R—K, the set P(f) is an additive group (Chapter One, Section 2, Lemma 
1). It is a nontrivial group when, and only when, the function f is periodic. 
Many of the properties of P(f) are shared by any additive group. 


Lemma 1. Let A be an additive group. Then (i) 0¢A; (ii) for any aeA the 
number (—a) is in A; (iii) when a and 0 are in A so is (a + 0); (iv) for any aeA 
the set {na | neZ} is contained in A. 
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Proof. We may assume that A is nontrivial. Then for aeA we must have (a — a) 
in A, which proves (i). Now that we know 0¢A, (0 — a) is in A for every aeA, 
proving (ii). To prove (iii) we note that if a,b are in A, then a, —b are in A, and 
so a — (—5) is in A. 

Finally, let aeA and suppose that na is not in A for some positive integer 
n. Let no be the smallest positive integer such that noa ¢ A.Then by (iii) we 
see that a+a = 2aeA; hence no > 2. It follows that (ng — 1) aeA because of the 
way No was defined. But, then, again using (iii), (m9 — 1)a + a = nga is in A, 
which is a contradiction. We have shown that naeA for every positive integer 
n. By (i), however, 0-a is in A, and by (ii) (-na) = (—n) a is in A for every n. 
Thus {na | neZ} C A as claimed. 


When A, B are additive groups and B C A we shall say that B is a subgroup 
of A. In particular, when aeA the set {na | neZ}, which is clearly an additive 
group, is a subgroup of A. 


Clearly, every additive group is a subgroup of C, but it is often useful to 
specify that we are working with a subgroup of R or a subgroup of Z. 


Corollary 1. Let A be a subgroup of the additive group R. If A contains a 
smallest positive member, say ag, then A = {nao | neZ}. 


Proof. We have seen that {nag | neZ} C A. Given beA suppose, first, that 6 is 
positive. Then ag < 6b, and we nay write b = kag+r where k is a positive integer 
and 0 < r < ao. Because both 6b and kag are in A, it follows that reA and hence 
that r = 0. Thus be {nag | neZ}.When b is negative, (—b) is a positive member 
of A and so (—b) = kag for some integer k. But then b = (—k) ap and we are 
done. 


Corollary 2. If A is a subgroup of the additive group Z, then either A = {0} 
or A = {né | neZ} where @ is the smallest positive member of A. 


Definition 2. A nonempty subset M of C is said to be a multiplicative group 
if zy~'e«M whenever both z and y are in M. 


It is clear that C\ {0}, R\ {0}, and Q\ {0} are multiplicative groups. So 
are the sets {1}, {1,—1}, and T (the unit circle). We have an analogue of 
Lemma 1. 


Lemma 2. Let M be a multiplicative group. Then (i) 1«M; (ii) for any reM 
the number x7! is in M; (iii) when z and y are in M so is ry; (iv) for any reM 
the set {z” | neZ} is contained in M. 


When M,N are multiplicative groups and N C M, we shall say that N is 
a subgroup of M. In particular, when zeM the set {2" | neZ}, which is clearly 
a multiplicative group, is a subgroup of M. 
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It is now necessary to specify the arithmetic operation under which a given 
set is a group. Unless it is clear, from the context, what operation we mean, we 
shall denote an additive group by (A, +) and a multiplicative group by (M,-). 


The exponential function, f (x) = e*, maps R onto the positive real num- 
bers. The set of all positive real numbers, let us denote this by R,, is clearly 
a multiplicative group. Thus f maps (R,+) onto (R;,-) and, furthermore, 
f(x+y) =f (x)-f (y) for all z,y in R. This particular function is, of course, 
both one-to-one and onto. Let us look at this kind of inapping in a slightly 
more general context. 


Let (A, +) be an additive group, let (M,-) be a multiplicative group, and 
let f: A— M be a function such that f (a + 6) = f (a) f (6) for all a,b in A. 


Because 0 ¢ M and f (a) = f (a+ 0) = f (a) - f (0), we see that f (0) = 1. 
But then 1 = f (0) = f(a+(-—a)) = f (a) f (—a), giving us the fact that 
f(-a) = f(a)~’. Using these observations we can show that the range of f 
is a subgroup of (Af,-). To do this suppose x,y elements in the range of f are 
given. We inust show that ry! is in the range of f. But we have a,b in A such 
that f (a) = x and f (b) = y. Thus (a — b)€A, because A is an additive group, 
and f (a — b) = f (a + (—b)] = f (a) f (—b) = zy™!, proving our claim. 


Let us now show that {aeA| f(a) = 1} is a subgroup of (A,+). Given 
a,b in this set (i.e., given a,b in A such that f(a) = 1 = f (b)) we must 
show that a — 6 is in a set (i.e., we must show that f(a—6) = 1). But 
f(a—b) =f [a+ (—6)] = f (a) f (-b) = f (a) f (b)~' =1 and we are finished. 


Exercises 4 
* 1. Prove Lemma 2. 


*2. The function f (z) = Inz maps R,; onto R. Moreover, for any z,y in Ry 
we have f (ry) = f (x) +f (y). This particular function is one-to-one and 
onto. Now let (M,-) be a multiplicative group, let (A,+) be an additive 
group, and let f: M — A satisfy f (ry) = f (x) + f (y) for all z,y in M. 


(a) Show that the range of f is a subgroup of (A, +). 

(b) Show that {xeM | f (x) = 0} is a subgroup of (M,-). 

(c) Show that U, = {+1,+7} is a multiplicative group. 

(d) Let f : Z — Us be defined as follows: f (n) = i" for every n. Show 
that f (m+n) = f (m) f (n) for all m,n in Z. Identify the range of 
f and {neZ | f (rn) = 1}. 

(e) Let g : Z — Us be defined as follows: g(n) = (—1)” for every n. 
Show that g(m+n) = g(m)g(n) for all m,n in Z. Identify the 
range of g and {neZ | g(n) = +1}. 
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+3. Let n be a fixed positive integer. Let U, = {zeC | z” = 1}. 


(a) Show that U,, is a multiplicative group; in fact, it is a subgroup of 
T. 

(b) Let p(z) be a given polynomial. The complex number r is a root 
of the equation p(x) = 0 if, and only if, p(r) = 0. We recall that 
r is a root of the equation p(z) = 0 if, and only if, z —r is a 
factor of the polynomial p(z). The largest positive integer / such 
that (x — r)* is a factor of p(z) is called the multiplicity of the root 
r. A root of multiplicity one is called a simple root. Show that any 
root of p(x) = 0 that has multiplicity greater than one is also a root 
of p’ (xz) = 0 (the derivative of p(x)). Conclude that every root of 
x" —1 = 0 is simple and hence that U,, contains n distinct nuinbers. 


4. Let Q(V2) = {a + bV2 | a,b are in Q}. Clearly, Q (V2) CR. 


(a) Show that Q (V2) is a subgroup of (R, +). 
(b) Show that Q (/2) \ {0} is a subgroup of (R\ {0},-). 


+5. When (M,-) is a multiplicative group and reM, the set {x" | neZ} C M, 
as we have seen. If there is an zeM such that {x" | neZ} = M, then 
we say that (/,-) is a cyclic group and we say that z is a generator of 


(a) Show that (U4,-) is cyclic and find all of its generators. 

(b) Show that (U,,,-) is a cyclic group and identify a generator. Hint: 
Every element of U,, is an element of T and hence can be written as 
e%F, 

(c) An additive group (A, +) is said to be cyclic if there is an aeA such 
that A = {na | neZ}. We have seen that any subgroup of Z is cyclic. 
Is Z cyclic? 
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We have seen that for any f : R — K, the set P(f) = {aeR | f(z + a) = f (x) 
for all x} is an additive group. Observe that for real numbers z,y we have 
f(z) = f(y) when z — y is in P(f); for then 2 — y = aeP(f) and f(z) = 
f(y+a) = f(y). Thus f is constant on certain subsets of R. We can get a 
better understanding of the structure of these sets by making use of the group 
theory we discussed in the last section. 


Definition 1. Let A be an additive group and let C be a subgroup of A. We 
shall say that the elements a,b of A are congruent modulo C, and we shall 
write a = bmodC, if (a — b) €C. 
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We shall give many examples of this very soon. For now we observe that 
congruence modulo C’ gives us a relation on the set A; it is in fact, as we shall 
now show, an equivalence relation (Chapter Zero, Section 2). 


Lemma 1. For any additive group A and any subgroup C of A, congruence 
modulo C is an equivalence relation on A. 


Proof. We must show that this relation is reflexive, symmetric, and transitive. 
To prove that it is reflexive we must show that for any aeA, a =a modC; i.e., 
we inust show that for any aeA, (a — a) is in C. But this is immediate because 
a —a=0 and 0 is in any subgroup of A (Section 4, Lemma 1, (i)). 

Next we niust show that for any a,b € A such that a = bmodC, we 
also have b = a modC. Again this is immediate because (a — b) € C and so 
— (a — b) = b—ais in C by Lemma 1. (ii), of Section 4. 

Finally, suppose that a,b,c are in A and a = b modC, b = cmodC. We 
must show that a = c modC. The first of these gives us (a — b) € C, and 
the second gives (b-c) € C. By Lemma 1, (iii), of Section 4, we may add 
two elements of C’. Thus (a — 6) + (b — c) = (a —c) € C. This, of course, says 
a=cmodC. 


For any aeA, [a] = {beA|a=b modC} is called the equivalence class 
containing a (Chapter Zero, Exercises 2, problem 1). The set of equivalence 
classes is denoted by A/C; read ” A mod C.” 


We began our discussion by noting that for any f : R — K, the set P(f) 
is a subgroup of the additive group R and that f(z) = f (y) whenever z = 
y modP(f). Thus f is constant on the equivalence classes of R; i.e., f is 
defined on the eleinents of R/P(f). 

As an illustration of this, suppose f has smallest positive period 27. Because 
P(f) is then {(27) n | neZ}, we see that r = y mod P(f) simply means r—y = 
(27) k for some integer k. But any real number z can be written as x = (27) k+r 
where & is an integer and 0 < r < 27. Thus each equivalence class contains 
a number (and clearly only one such number) from the set (0,27). So we may 
regard f as being defined on this last set. This is, of course, just another way 
of saying that f is a function on the unit circle. 


For subgroups of the additive group Z the construction of the equivalence 
classes is simple and illuminating. To be specific, we take A = Z and C = 
{6n | neZ}. Then a = b modC means a — 6 is a multiple of six. Now for 
any aeZ we may write a = 6n +1, where n is an integer, r is an integer, 
and 0 < r < 6. Thus any aeZ is equivalent to one, and clearly only one, of 
the numbers 0, 1,2,3,4,5. Thus A/C = {[0], [1] , [2] , [3], [4], [5)}, where (0) = 
{neZ | n = OmodC}.... 

The choice of six is simply one of convenience. We should note here that 
C = {6n | neZ} is usually denoted by (6) and the set A/C = Z/ (6) is often 
denoted by Z,g. Furthermore, a = 6 mod C is often written a = 6 mod6. 
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Given any snbgroup C of the additive group Z, we have seen that C' = 
{né | neZ} where ¢ is the smallest positive member of C (Corollary 2 to Lemma 
1 of Section 4). We shall denote this subgroup by (@). Note that Z/C = Z/ (¢@) 
consists of the @ equivalence classes [0], {1],...,[@— 1]. This set is denoted by 
Ze. 


Returning now to the case of an additive group A and a subgroup C of A, 
let us note that there is a natural “addition” on the elements of A/C’. Given 
[a] , [5] in this set we choose an element of [a], let us take a, and an element of 
[6], take 6, form their sum in A and set [a] + [b] equal to the equivalence class 
containing a + 6. To show that this is well defined, we must show that if a’e (a) 
and b’e(b), then a’ + b’ and a + 6b are in the same equivalence class; i.e., we 
must show that (a + 6) = (a’ +6’) modC. This is very easy. Because a’e [a] 
and b’¢ ([b] we have a — a’ in C and b — 0’ in C. Thus (a — a’) + (b6— 8’) isin C 
(Section 4, Lemma 1, (iii)), which says (a + 6) = (a’ + 0’) modC. 


Let us illustrate this operation for the case of Zg = Z/ (6). To add [3] and 
[4] we take any element of [3], let’s take 3 (you can take 45 if you prefer). and 
any element of [4], we'll take 4 but you can take 52 if you wish, and add them. 
So, because 3 + 4 = 7, [3] + [4] = [7] = [1] because 7 = 1 mod6 (note that 
45 + 52 = 97 so [3] + [4] = [97], but 97 = 1 mod6). 

Following this pattern we can construct an "addition table” for Z,. It is 
customary, and convenient, to denote the elements of this set by 0, 1, 2,3, 4, 5. 
We have: 


We should stress that the elements of Zg are not complex numbers and the 
operation + is not ordinary addition. The table shows that 0 is the identity 
element of Zg. We mean a+ 0 = 0+ a = a2 for every aeZg. Furthermore. given 
any aeZz there is a beZg such that a+ b= 0. 

We call b the additive inverse of a. So we can define a kind of subtraction: 
a — b is taken to mean a plus the additive inverse of 6. For example, 3 — 2 is 
to mean 3 + 4 because 4 is the additive inverse of 2. Thus 3 — 2 is 1 while 
3$-4=34+2=5. 

There is a natural map Y.6 : Z — Zg. For any neZ we take ye (n) to be the 
remainder one gets when n is divided by 6. So ¥g (12) = 0, v6 (11) = 5. Ye (7) = 
1, etc. Observe that, for any m,n in Z, yg (m+n) = ye (m) + v6 (72). 


134 Chapter Four # The Fourier Transform 


We warn the reader that in writing m +n we are using ordinary addition, 
whereas in writing yg (m) + vg (n) we are using the operation defined in Zg. 
Interpreting this sum as an ordinary addition leads to erroneous results. 


Exercises 5 


x1. Let M be a multiplicative group and let N be a subgroup of 7. 


(a) For any s,t in M define s = tmodN to mean st~'eN. Show that 
congruence modulo N is an equivalence relation on M. 


(b) The set of equivalence classes of M under the relation defined in (a) 
is denoted by M/N. For any {s], [t] in M/N define [s]  [t] as follows: 
choose se |s], te [t] and set (s] » [t] = [st]. Show that this operation 
is well defined; i.e., if s’e[s] and t’e [t] show that s’t’ = st mod N. 

(c) Show that the element [1]¢A//N is a multiplicative identity for this 
set; i.e., show that [s] * [1] = [1] * [s] = (s] for all [s] in M/N. 

(d) For each (s]«M/N show that there is a [t] «M/N such that [(s] * [t] = 
(1). 


(e) Let M be the multiplicative group of all nonzero complex numbers 
and let N = T (the unit circle). Identify the elements of M/N. 


2. Compute the addition table for Z 9. Define yg (n) to be the remainder 
one gets when n is divided by 9. Show that when n > 0, yo (n) can be 
computed as follows: Add the digits of n. If the result is greater than 9, 
add its digits. Continue until you get a number between 0 and 9. 


3. (a) We refer to problem 2. Show that for all m,n in Z, yg(m+n) = 
Y9 (m) + Yo (7). 
(b) Compute ¢9 (2315) + yo (7489). Show that this is yg (9804). 
(c) Find {neZ | yg (n) = 0}. 


*4. Let A be an additive group and let C’ be a subgroup of A. Define y: A — 
A/C as follows: y (a) = [a] for every aeA. 


(a) Show that y(a + 6) = y(a) + 9 (0) for all a, be A. 
(b) What is {aeA | y(a) = [0}}? 
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6. Finite Fourier Transform 


The Fourier transform plays a fundamental role in modern digital signal pro- 
cessing (see Notes). This application requires the handling of enormous amounts 
of numerical data. I have been told that a color TV picture, for example, re- 
quires tlhe processing of abont 8 million bits of data per second. What is clearly 
needed lhiere is an efficient numerical technique for computing the Fourier trans- 
forms involved because, in most cases, they cannot be found analytically. The 
fast Fourier transform algorithm is such a technique. Its efficacy is remark- 
able. In some cases 1 billion coniputations can be replaced by 1 million, one 
one-thousandth of the original number. What one does is replace the Fourier 
transform of a function defined on R by the Fourier transform of a function 
defined on Z, for large 2. The algorithms are then applied to this replacement. 
We shall concern ourselves witl: the problem of how the Fourier transform can 
be defined on Z,. Once we have done that, we investigate its properties and 
give some applications. 
In the case of the additive group Z we considered 


O(a) = {a2 C13 1a(n) coo} 


—- 8 


to @) 
For a,b in €!(Z) we defined (a * b) (n) = yo a(n — m)b(m) and note we need 
—00 
here the group operation in Z; i.e., n - m = n+ (-—m) where (—m) is the 
additive inverse of m. We have seen that subtraction is possible in Z, and so 
we can define convolution for functions on this set. Let us do that and work 
out a siinple case in detail. To begin with let 


t-1 
d= fa %—C1Y Lal) coo] 


n=0 


Clearly, every function on Z, is in @! (Z;), so our notation is a little silly. We 
use it to emphasize the analogy between whliat we are doing here and what we 
did on Z. 


Definition 1. For any a, bel! (Z;) we define the convolution of a and 6, a +b, 
(-1 


as follows: (a * b)(k) = }_ a(k — m)b(m). 


m=0 


Let us stress that the subtraction is to take place in Z,. To get a “feeling” 
for this operation, let us work out the convolution of two elements a, b in @! (Z3). 


136 Chapter Four s The Fourier Transform 


We have: 


2 
(a *b)(0) = 5 a(0 — J) b((j) = @(0 — 0) b(0) + a(0 — 1) b(1) + a (0 — 2) (2) 
37=0 
= a(0)b(0) + a(2)b(1) +.a(1)6(2) 


because 0 — 1 is the additive inverse of 1 in Z3 and this is 2, while the additive 
inverse of 2 is 1. Similarly, 


2 
(a * b) (1) =) a(l — j)b(j) = a(1 — 0) b(0) + a(1 — 1) b(1) +a (1 — 2) (2) 
—0 


= a(1)6(0) + a(0)6(1) + a(2) b(2) 


because, in Z3, 1 —-2=1+1=2. 
2 

(a «b)(2) = $° a(2 - j) b(j) = a(2 - 0) (0) + a(2 — 1) b(1) + a(2 — 2)b(2) 
j-0 


= a(2)b(0) + a(1)6(1) + a (0) 6(2) 


Observe that if we write a and b as (ay, a), a2) and (bo, b;, 62), respectively, 
then a * b = (agbo + a2b, + a,b, a)bp + agb; + a2b2, agbp + a,b; + abe). 


This last expression clearly shows that @! (Z3) contains an identity. The 
function e = (1,0,0) satisfies a «e = e xa = a for every a. 


This operation is sometimes referred to as “cyclic convolution.” Its basic 
properties are easily established. 


Lemma 1. For any a,b,c in @' (Z;) we have 
(a) at b= bea; 
(b) a«(b*c) = (a*b) *c; 
(c) a(a*b) = (aa) *b = a (ab) for any aeC; 
(d) ax(b+c)=aeb+arc; 
(e) There is an element ee?! (Ze) such that a *e = e «a = aa for every a. 


The Fourier transform on Z takes each function in @! (Z) to a continuous 
function on T; the unit circle in the complex plane. Recall that for aeé! (Z), 
a(80) = 5% a(n) e'”®. Let us examine, a little more closely, e'"® for @ fixed. 
We can define cg : Z — T by setting cg (n) = e'”® for each neZ; remember 8@ is 
fixed. Observe that co (m +n) = e™*")9 = eM. e129 — ¢g (m) - cg (n) for all 
m,n in Z. Maps like this have a name. 
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Definition 2. A mape: Z — T such that c(m +n) = c(m)-c¢(n) for all 
m,neZ is called a character of Z. 


Lemma 2. To every character c of Z there corresponds a unique real number 
6¢(0, 277) such that c(n) = (@%?)" for every neZ. 


Proof. Because ¢(m) is never zero and ¢(n +0) = ¢(n)-c (0), we see that ¢(0) = 
1. Furthermore, ¢(n) = c(1 + 14...+ 1) =c(1)" for every natural number n. 
Combining these observations we have 1 = c(0) = ¢(n + (—n)) = e(n) -¢(—n) 
and so c(n) = ¢(1)" for every integer n. 

Now c(1) € 7 and so c(1) = e” for some real number y. If y ¢ (0,27) 
we may write y = (27)k + @ where & is an integer and @ € (0,27). Clearly. 
then ¢(n) = (etv)" = cN2krt+O)n — _t2kne (e'8)" = (ef)" for every n € Z. This 
proves the Icinma because it is clear that no two 6's in (0, 27) can give the same 
character of Z. 


Corollary 1. There is a one-to-one correspondence between the characters of 
Z and the points of T. 


Proof. Because there is a one-to-one correspondence between the points of T 
and the set (0,27), this follows immediately from our lemma. 


For aeé!(Z), @(8) = o* a(n) e'"®. If we think of @ as fixed, then we 
are taking the character e’? and applying it to each n. cg(n) = e'"?, then 
multiplying by a(n) and finally summing over the domain of a; i.e., summing 
over Z. We will do something similar for functions in f! (Z,). To begin. however, 
we must first identify the "characters of Z,.” 


Definition 3. Let f be a natural number and consider the set Z,;. A map 
+: Z¢ — T is called a character of Z; if y(m+n) = y(m)-7(n) for all mn 
in Z¢. 


Lemma 3. There is a one-to-one correspondence between the characters of 
Z, and the solutions to the equation 2! = 1. 


Proof. Let w be a complex mamber such that uf = 1. Define a map 7: Z;) — T 
by setting 7 (k) = w* for each keZ,. Clearly, y(n +n) = wt" = ww" = 
y (mm) > (71) for any m,n in Z;. Furthermore, ifm = nmod f, then ¥ (mm) = 4 (7). 
so + is well defined and is clearly a character of Z. 

Suppose now that y is a given character of Z,s. Then becanse 
y(k+0) = y(k)y7(O) we see that 7(0) = 1. Also. for any keZ, we have 
y(k) = 7(1+...4+1) = + (1)*. Thus 7 is completely determined by y (1). 
But 7 (@) = 7 (0) = + (1)! = | and so (1) is a root of the equation s£ = 1. 
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Definition 4. Let € be a natural number. The set of all solutions to the 
equation z‘ = 1 is denoted by U;. The elements of this set are called éth roots 
of unity. 


We have seen (Exercises 4, problem 3) that U, is a multiplicative group 
containing elements. 
Let us look more closely at U, and its multiplication table: 


It is, of course, true that 14 = 1, but 4 is not the smallest positive integer 
for which this is true. Clearly, 1! = 1; we say 1 has order 1. Similarly, (-1)4 = 1, 
but the smallest positive integer for which this is true is 2; we say (—1) has 
order 2. The remaining elements i and —: each has order 4; 4 is the smallest 
positive integer n such that :” = 1 and it is the smallest positive integer n each 
that (-i)” = 1. 

Note that {i” | neZ} is just {1, —1,i, —i}; ie., it coincides with U,. Thus 
U, is a cyclic group and : is a generator of this group (Exercises 4, problem 5). 
Clearly, —i is also a generator of this group. 


Definition 5. Let @ be a natural number and let aeU¢. The order of a, 6 (a), 
is the smallest natural number k such that a* = 1. When 6 (a) = @ we say that 
a is a primitive @th root of unity. 


Clearly, e?*'/ = cos 24 + isin 3 is a primitive éth root. When ¢ = 3, this 
gives us cos 2% +isin 2% = —; +i¥3, When ¢ = 4, we get cos 2% + isin 2% = 1. 
Lemma 4. The order of each element of U; is a divisor of @. An element w of 
U; is a primitive @th root if, and only if, it generates the group U,. 


Proof. If aeU, and 6 (a) = k, then clearly k < @. Now @ = qk +r where q and 
r are integers and 0 <r < k. Thus 1 = af = a%+" = (a*)* a" = (1)'a" =a", 
which says, because k is the smallest natural number such that a* = 1, that 
r= 0. 

Now suppose that weU, has order @. Then Uy D {w' |l1<j< e} so to show 
that these sets are equal we need only show that the last one contains @ distinct 
members. But if w? = w? for 1 < p<q< @, then w?-? = 1 andO0<q-p< @. 
This contradicts the fact that w has order @. Thus w generates U,. 
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Finally, let w be a generator of U;. If @(w) = k < €, then{u" | jeZ} = 
{w!.w?,w,...,w*}, and this contains k < f elements. So the order of u: must 
be @, 

Definition 6. Let é be a fixed natural number and let w be the primitive (th 
root ¢2"/€, For any acl! (Z-) we define @ (w*) = 5-°-) a(j) (w*)’ and call this 
function, its domain is Us, the Fourier transform of a. 


For the case of a function aeé! (Z3) this becomes 


2 
& (w°) = S> a(j) (w°)? = a (0) +0(1) + (2) 
j=0 
2 
a(w) = S-a(j) w =a(0)+a(1)w+a(2)w? 
j=0 


2 
a(w?) = 5 a(j) (w?)” = (0) +.a(1) wu? + a(2) w4 
)=0 


A more convenient way to write this is 


a (w°) 11 1 a (0) 
a (wu) =| 1 ww? a(1) 
a (w?) 1 w* w’ a (2) 


4 


201/3¢U,, wt = w. 


Of course, because u' = € 

Recall that the map F takes ¢' (Z) into C (T). So if aeé!(Z), F(a) = 4 is 
defined on T. Furthermore. F is linear, invertible. and “preserves” convolution. 
The map F; takes each function acé! (Z~) and transforms it into a function 
@ on U;. However, it is customary, or at least very conmmon, to regard a as 
also having domain Z;. This is done as follows: Recall ¥(k) = u* is a map 
from Z,- to U; that satisfies y(A +m) = 7(k)¥(m) all km in Ze. When w 
is primitive, as it is in this discussion, y is both one-to-one and onto. Thus 
we may identify (Z¢.+) with (U;.-). With this understanding @(w®) = @(0). 
a(w) = @(1),....4(w'") =a(- 1). 


Onr Fourier transform may now be written F;(a)(k) = a(k) = 
f-1 fed aes 
> a( j) (w*)? = >= a( j) (c#*) in matrix notation: 
}=0 2=0 
a (0) 1 1 ! A a (0) 
a(1) J 1 ow u o = =6wel a(1) 


? 


a(@-1) 1 (w'") (wf)? (wi!) a(t ~1) 
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Theorem 1. For each fixed @ the map Ff, is linear and preserves convolution; 
i.e., Fy (a *b) = Fe (a) - Fe (db). 


Proof. 


k=0 k=0 | j=0 


é-1 é—1 | é~1 
Fy (a b)(n) = > (a*b)(k)(w")* = >> Ss (j) b(k - i (w")* 


- Yau) pau 5) (w") ‘ 


k=0 


e~ 
= [Earn pau j) (w") i 


j=0 


= Fr (a) (n) - Fe (b) (n) 


Let us set F,' (a) = pea a(j) (@ ky? Then 


¢-1 . 
Theorem 2. For any acl! (Z;) we have @(k) = F(a) (k) = >_ (3) (w*)? 
e=1 
and a(k) = Fy ' (4) (k) = 4 5 a(j) (w*)’ 


j=0 


Proof. In this last sum we replace @(j) by its definition obtaining 


7800" y=7L (Lom (w’) ") ry 


n=0 
7 Se bat oy 
n=0 
When n = & this inner sum is @, and when n # k it is zero (Exercises 6, 
problem 1). 
Thus 


ee 


72! G(j) (w*)" = a(k) 
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We have seen that the Fourier transform on Z,, the finite Fourier transform, 
maps functions defined on Z, to functions defined on the set of characters of Z,, 
the set Zi. Now Zz, can be identified with Ue, and this last set can be identified 
with Z,. So the transform 9¢ maps functions defined on Z,¢ to functions defined 
on Ze. The transform is linear and so in this, finite, case it can be represented 
by a matrix. Moreover, the transform preserves convolution and is invertible. 

We shall see (Exercises 6, problern 1) that when w € U; (i.e., when w is 
an fth root. of unity) and w # 1, then ae w) = 0. Now suppose that w is a 
primitive @th root of unity. Then for any m 0O<m . €—1,w"™ is in U, and, 
because w is primitive. w™ #4 1. Hence Be Zo(w™)? = 


Let us work out the case ¢ = 3. Recall: 


1 we wi 
where, of course, w* = w. Consider now 
11 1 1 1 1 3 lt+o+o? 1+0?7+u4 
lw wltl o wo] =] 1l+w+u? 3 l+o+u? 
1 w? wtf \l wo ot L+w?tw? l+wt+w? 3 


we have used the fact that w = 1/w (Exercises 6, problem 2(b)). 
This last matrix is just 


and, it is easy to see, reversing the factors gives us the same result. Thus to 
invert 933 we take its matrix and replace each entry by its complex conjugate 
and multiply the result by 4. In another notation: 


Sa(f)(k) = F(R) = DS Funtet = Yo sete 


=0 j= 
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and 
351 (9)(k) = j yaiary = Yaa ag yk 
j=0 


Remark on Notation. Some writers define the Fourier transform in a 
slightly different way to obtain a symmetry between the transform and its 
inverse. They set 


S2(f) = f(k) = Yt: “y F(j)(w*)? 
and obtain 


$7 1(9) = 4S atayaty 


ve 
This is also done in connection with the Fourier transform on R. One writes 
~ 1 o0 iit 
iS = = / t)e™***dt 
(N=Fz)= Fe] se 
so that 


371(9) = =e | “_o(a)e"**dz 


We see little advantage in doing this, but the reader should be aware that it 
is not uncommon. 


Exercises 6 


«1. Let w € Ue, w 1. One of the properties of éth roots of unity that helps 
make the fast Fourier transform algorithm so efficient is the following: 
a w) = 0. Follow the steps below to give two proofs of this. 


(a) Set r = S>5hu/. Show that wr = 7 and hence, if r # 0, we have a 
contradiction. 


(b) Factor w” — 1. 


*2. Let w € Us. Show that (a) w-! = w'-!; (b) & = 1/w; (c) if 2 is a prime 
(an integer larger than one that has no factors other than itself and one), 
then every element of Uz except +1 is a primitive éth root of unity. 


7. An Application 143 


3. Show that ¢'(Z,) contains an element e such that e « f = f for every 
f € '(Z,). Compute 9;(e). An element f of €'(Z,;) is said to be invertible 
if there is an element g € @'(Z,) such that f *g =e. 


(a) Show that f is invertible if, and only if, S¢(f) is never zero. 

(b) If f is not invertible find all g such that f * g = 0. 

(c) f is called a divisor of zero if there is g # 0 such that f *«g = 0. 
Show that f is a divisor of zero if, and only if, f is not invertible. 


(d) If f is invertible we have an element f~! € ¢'(Z,) such that fx f-! = 
e. Show that f~! is unique and that 9,(f~!)(”) = 1/S¢(f)(n) for 
every 7 € Zy. 


4. Write the Fourier transform on ¢'(Z2) in matrix form and then invert the 
matrix. 


5. (a) Let f € @'(Z2) be given by f = (0,1). Compute 92(f). Similarly, 
compute 93(f) where f = (0,0, 1). 


(b) Characterize all g € ¢'(Z,) such that g*g*---*g (@ factors) is e. in 
terms of 9;(g). 


6. Given f = (ao, a,,42), 9 = (bo, b1, 62) in ¢"(Z3) compute: 


(a) f and §; 
(b) f +9; 

(c) f *9; 
(d) (f *9). 


7. Continuous characters of R. 
Let c: R — T be a continuous mapping such that c(z + y) = c(z) -c(y) 
for all z,y in R. Show that there is an element a € R such that c(z) = 
(e'*)* for all z € R. Hint: Recall that Q is dense in R (Chapter Zero, 
Section 3, Corollary 2 to Theorem 1). 


7. An Application 


There are some interesting, and surprising, connections between the finite 
Fourier transform and tlie theory of polynomial equations. These were found 
by Lagrange in his study of the cubic and quartic. The calculations involved 
are a bit messy, and we shall try to present the underlying ideas first so as not 
to lose sight of them in the technical details. Let us start with the simplest case 
of all, the well-known quadratic. 
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Suppose we are given the equation 
ax? + br +c =0 (1) 


where a # 0 and the roots are 2), rz. The French mathematician Vieta first 
noted the relation between the roots and coefficients. If this case they are 


—b 
ry +22= rk (2) 


c 
¥\i2=— (3) 


As functions of :r}, x2 these are clearly symmetric, meaning they are in- 
variant when we interchange the two variables. It turns out that any symmetric 
function of z;, z2 can be represented in terms of the two functions given in (2) 
and (3); i.e., any symmetric function of the roots can be expressed in terms of 
the coefficients. We shall prove this only for the special cases we need for our 
discussion. As an example, consider A = (x, — 22)*. This is clearly invariant 
when we reverse our variables. We can write A in terms of a, 6, and c as follows: 


A = (x4; —2z2)* = x + a — 27,22 = x + x3 + 27,272 — 42,22 (4) 
= (2) +39)? — 42422 
—b,  c  6?-4ac 
re re 
where we have used (2) and (3). Combining (2) and (4) we have the system 
b 
@y+r2>-- (5) 
a 
— 4ac 
I) - 22 = 
a 


This is easily solved, but note that it is equivalent to: 


b 
1 1 D\ a 
= (6) 
i -1 r2 b? — dac 
a 


The coefficient matrix in (6) is the finite Fourier transform obtained using 
the primitive square root of nnity. We may solve (1) by solving (5) or (6). In 
the latter case 


: (7) 
© l —1 vb — 4ac 


7. An Application 145 


Let us turn now to the cubic 
zr? + pzr*+qr+r=0 (8) 
with roots x}, 22, 73. Lagrange chose, and fixed, a primitive cube root. of unity 
w and considered the form 
A, = (11 + wre + w*z3)° (9) 


He noted that this is not symmetric. Three of the six permutations of 
Z1, Z2, 73 do leave it invariant, but three transform it into 


Ao = (zr; + w*z2 + w23)° (10) 


Similarly, the three permutations that leave (9) invariant also leave (10) 
invariant, whereas the other three transform it into (9). This means that A, +A2 
and A, Ag are invariant under all six permutations and hence can be expressed 
in terms of p, qg, and r; we shall do this explicitly below. It follows then that 
A, and Ag, can be found by solving the equation 


X" - (Ay + Aa)X + (AiA2) = 0 (11) 
and once we have those numbers we may write 
I} +I72+273 = —p 
Q, + wr + w*zr3 = Ay (12) 


z+ ure + wr = v Ao 


or 
1 1 1 I} —p 
1 ow w? a VAL (13) 
1 w* w r3 WA2 


Again we see that the coefficient matrix is the finite Fourier transform. 
As we have seen in Section 6, the system (13) can be solved by inverting the 


matrix. We find that the solutions to (8) are 


I) 1 1 1 —p 


Tt |=3/ 1 U wD VO) 
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We now show how one can express A, and Ag in terms of p, q, and r. 
Recall 21, 22, £3 are roots of 
x? + pr?+qzr+r=0 (8) 


The Vieta formulas in this case are 


41+22+273=-p (15) 
1122 +2123 +2973 = Q (16) 
Z)212%3 = -T (17) 


Observe that (—p)? = (z1 +22+23)? = 22 +234+23+2(2)22+2123+2223) 
and hence, using (16), we have 


x? + 22 + 22 = p* — 2q (18) 
Now if we put 2}, 22, 73 into (8) and add the results we get 
x3 +23 + 23 + p(x? + 22 + 22) +: g(z1 + 22 + 23) + 8r = 0 
and putting (15) and (18) into this equation we find 
ar? + 23 + 23 = —p(p* — 2q) + pq — 3r = —p* + 3pq — 3r (19) 


We are trying to express A, A2 and A, + Ag in terms of p, g, and r. The 
results obtained above enable us to do this for A) A2 rather easily. Let 


6= (xy + w7z2+ w*2r3)(z) + wr + w2z3) 
= x? + x3 + x3 + w(z123 + 212 + F273) + w* (2323 +2122 +2223) (20) 
= (p? — 2q) + (w+ w*)q 


where we have used (18) and (16). Now w is a primitive cube root of unity, 
hence not equal to 1. Thus w? + w+ 1 = 0 (Exercises 6, problem 1). Using this 
in (20), we get 


6 =p? —2q-q=p’ - 3q 
and so 
Ay Ag = (21 + wre + w*z3)°(z) + wre + wz3)° = (p? a 3q)° (21) 


The expression for A; + Ag in terms of p, g, and r is more difficult to 
obtain. We need a preliminary result first. Let 


N= r2rq + 2,23 + x?r3 + 2123 + x323 + 2273 (22) 
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By equations (15) and (16) we may write 
—pq = (x) + Fo + 23)(11 22 + 1173 + F273) 
= r?x2 + r?z3 + 217273 
Se z\22 + 2%,297%3 + T3223 
+ 2)7973 + r\22 f r972 


=A-3r 


Hence 


A = —pq + 3r (23) 


A direct calculation shows that 
(a+b+c)> =a +b +c? + 3(a7b + ab? + a2c + ac? + b*c + bc”) + Babe 
(24) 


Using this we find that 
A, = (4, + wr +: w?z3)% = 23 4+ 23423 
+ 3[z?w2z2 + ©(wz2)? + rew*r3 + r(w?23)? + (wz2)*w*z3 


+ (wz2)(w*zr3)*) + 62,2273 
= 234234 23 + 3w[zr2zq + 2,22 + 2223] + 3w? (x, 23 + 2223 + F273] 


+ 627;2273 
and, similarly, we find that 
Ao = (4, + wre + w23)° = zr + x + x3 + 3w?[r722 + r\23 + 1273] 
+ 3w[z\23 + r723 + L273] + 62,2273 
Adding and again using the fact that w? + w+ 1 = 0 we find that 
A, + Ag = 2x} + 23 + 23) — 3( x2 re + 2,22 + x273 + 2,23 + 2323 + £273) 
+ 127127273 
Finally, using (19), (23), and (17) we get 
A, + Ag = (x1 + wr2 + w*z3)3 + (21 + w? x2 + w73)3 
= 2(—p* + 3pq — 3r) — 3(—pq + 3r) — 12r 
= —2p* + 9pq — 27r 


(25) 


Equation (11) is then 
x — [Ai + Ag]X + Ay Ag = x — [—2p* + 9pq — 27r|X + (p* — 3q)? = 0 
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From this we can solve the cubic as we showed earlier. 


Note. In applying this method we must choose cube roots of A, and Ag. 
We must be careful to do that in such a way that (/A, A> = p? — 3¢. 


To illustrate how this method may be used, we shall solve an equation with 
known roots. We take 


2? — 627 + 1llz-6=0 


whose roots are easily seen to be 1, 2, and 3. Clearly, p = —6, g = 11, and 
r= —6. Thus 


A, A2 = (p? — 3q)° = [36 — 3(11)]* = 27 
and 


A, + Ag = —2p* + 9ng — 27r 
= —2(-6)° + 9(-6)(11) — 27(-6) = 


Hence our quadratic is simply 
X’ 427=0 
and, solving this, we find that 
A, =i 39/7, A, = -i39/? 


We must take the cube root of each of these quantities. Of the three cube 
roots of 7, we select —7. With this choice we find that 


/A, = -iV3 and @/A, = iv3 


Note that /A,A, = = 3 = p* — 3q. The roots of our equation, 
1, £2, 23, are given by 


a 11 1 -p 


wo] — 
— 

Ee 

o~7! 

e 

w 
> 
ue 


t2 


Z3 1 we @ V/Ao 
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which leads to (taking w = >! + i¥3 ) 


2 = 3(-p+ Var + VEx) = 5[-(-6) — v3 + iv9] =2 
gee 3(-p + /B, + wi/Ba) = (6 — wiv3 + wiv] 

= 5(6 + iV8(w — w?)] = 5[6 + (év3)(iv3)] = 5 {6 - 3] =1 
Z3 = 3(-p+ wi + w? */A2) = 5 (6 + w(-iv3) + w?(v3)] 


= (6 + (w? ~ w)iV3] = AG - (iV¥3)(iV3)] = ; (9) = 3 


Exercises 7 
1. Use Lagrange’s method to solve: 
(a) c>-1=0 
(b) 2? — 527+ 8r-4=0 


2. A permutation of a finite set is a one-to-one map from this set onto itself. 
There are six permutations of the set {1, 2, 3}. These may be written as 


1 2 3 1 2 3 1 2 3 
d= : e= f= 

2 1 3 1 2 3 3 1 2 
where we understand that a(1) = 2, a(2) = 3, a(3) = 1, etc. Let us 


call 2, + wx2 + w*z3, y, and let a(y) = Tai) + WXa2) + w?r4(3) = 
Z2 + w7r3 + wr). 


(a) Show that a(y) = wy, and f(y) = wy. 
(b) Show that b(y) = w2c(y), and d(y) = we(y). 


(c) Conclude that y? = (1; + wr2 + w?z3) is unaffected by a, e, and 
f whereas b, c, and d transform it into (2, + w*z2 + wz3)°. 


3. Let A = (2, — r2)(z) — £3)(x2q — 23). Referring to problem 2. take 
a(Q) = [zac1) — Lac2))[Zar) — Fa(3][a(2) — Za(3)] and similarly for the 
other permutations. Which permutations map A to A (these are called 
even) and which map A to —A (these are called odd)? 
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8. Some Algebraic Matters 


We have seen that. for a fixed natural number n, the multiplicative group (Un, -) 
is generated by any primitive nth root of unity. In particular, if w is a primitive 
6th root of unity. then w® = 1, six is the smallest positive integer for which 
this is true, and Ug = {w°,w,w?,w>,w4,w5}. It is easy to see that w?, which is 
a 6th root of unity, is also a cube root of unity. The same is true of w*. The 
element w* is a square root of unity, whereas w° is primitive. 

There is a problem here that we shall solve in this section. A complex 
number z is an nth root of unity if z” = 1. But, for such a number z, we may 
have :* = 1 for k <n. The sinallest positive integer for which this is true was 
called the order of z (Section 6, Definition 5), 0(z), and we have seen that 6(z) 
must be a divisor of n. What we shall do is figure out how to compute the order 
of every nth root. We have already seen how to begin. We choose a primitive 
nth root w and note that because U,, is equal to {wu | 0 < j <n — 1}, every 
nth root of unity is equal to uw for some j = 0,1,2,--- ,n— 1. So all we need 
do is figure out how to compute the order of u”. The problem can be solved by 
recalling a little number theory. 

When a, are integers we say a is a factor, or a divisor, of 6 if there is 
an integer m such that am = 6b. When this is the case we say a divides b (or 
divides evenly into 6), and write a|b. A useful observation is this: 


Lemma 1. Ifa,6,c, and d are integers, if a+b = c, and if d divides any two 
of the numbers a, b,c, then it divides the third one. 


Definition 1. Let a,b be two positive integers. A positive integer d is said to 
be the greatest common divisor of a and b if (i) dla and d|b; (ii) any integer c 
that divides both a and 6 also divides d. 


So, for example, the numbers 18 and 24 have 1, 2, 3, and 6 as common 
divisors. The greatest of these is, of course, 6. 


Lemma 2. Any two positive integers a,b have a unique greatest common 
divisor d. Furthermore, there are integers ro and yo such that d = azo + byo. 


Proof. Let Af(a,b) = {ax + by | x.y are in Z}. Clearly, a = a-1+6-0 
and b = a-0+6-1 are in M(a,b), and because (ax, + by) — (ax2 + by2) = 
a(r; — rz) + 6(y, — y2), this set is an additive group (Section 4, Definition 
1). Hence, by Corollary 2 to Lemma 1 (Section 4), there is a smallest positive 
integer d € M(a,b) and M(a,b) = {kd | k € Z}. It follows that dja and 
d|b. However, because d € M(a,b), there are integers rg and yo such that 
d = aXo + by. From this, and Lemma 1, we see that any common divisor of a 
and 6b must divide d. 

To see that d is unique we simply note that any two greatest common 
divisors of a and 6 must divide each other. Because they are positive integers 
they must then be equal. 
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Definition 2. Given two positive integers a and 6, we denote their greatest 
common divisor (their g.c.d.) by (a,b). We shall say that a and 6 are relatively 
prime when (a,b) = 1. 


From Lemma 2 we see that when (a,b) = 1 there are integers x9 and yo 
such that 1 = azo + byp. The converse is also true. If, for two natural numbers 
a and 0, there are integers rp and yo such that 1 = azo + byo, then, by Lemma 
1, a and b must be relatively prime. This has a useful consequence. Suppose 
€ = (a,b), so € = ax + by for some integers x and y. Then 1 = ($)r + (3)y, 
showing that the natural numbers 7 and b are relatively prime. 


If a,b,c are natural numbers and al|bc we cannot conclude that a|b or that 
alc. For example, 6|24 so 6|8 - 3, but 6 does not divide either 8 or 3. This 
observation gives meaning to our next result. 


Lemma 3. Let a,b,c be natural numbers and suppose that a and 0b are rela- 
tively prime. If albc, then alc. 


Proof. We have 1 = (a,b) and so 1 = az + by for some integers x and y. Thus 
c = (ac)z + (bc)y and hence, by Leinma 1, alc. 


Recall that a prime nuinber is an integer greater than 1 whose only factors 
are itself and 1. If p is a prime and a is a natural number, then either pia or 
(p,a) = 1. Thus: 


Corollary 1. Let a,b be natural numbers, let p be a prime, and suppose that 
plab. Then either pla or p\b. 


We are now ready to compute the order of w*, where w is a primitive nth 
root of unity and 0 < k < n—1. Recall that the order of w* is the smallest 
positive integer d such that (w*)? = 1. We have seen that if for some positive 
integer m, (w*)™ = 1, then d|m. It is easy enough to repeat the argument here. 
Because d < m, m = qd+r for integers q and r with 0 < r < d. But (w*)™ = 1 
and (w*)? = 1 so (w*)" = 1, showing r must be zero. 


Theorem 1. Let w be a primitive nth root of unity. Then the order of w* 
is Than)’ i.e., the integer obtained by dividing n by the g.c.d. of n and k. In 
particular, w* is primitive if, and only if, n and k are relatively prime. 


Proof. Suppose that the order of w* is d and that @ = (k,n). We want to show 
that d = 7. Because these are positive integers we need only show that they 
divide each other. 

Now (w*)"/é = (w”)k/é = (1)*/* = 1 and so, by the observation we made 
immediately above, d|(#). Because (w*)4 = 1, n|kd. It follows that ($)|(4#), 
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which shows (#)|(4)-d. But $ and # are relatively prime. Thus (3)|d and we 
are done. 


The number theory we have discussed here has some further interesting 
applications that we shall now explore. We first note that any additive subgroup 
of Z has an unusual property. Given a subgroup (@) of Z we have (a- 6) € (@) 
whenever b € (@), and this is true for all a € Z. It is this property that enables 
us to introduce a new operation, a kind of multiplication, onto the set Z,. First 
we need: 


Lemma 4. Let (@) be any additive subgroup of Z and suppose that a = a’ 
mod (@), b= 06’ mod (é). Then, a-b=a’-b' mod (@). 


Proof. Because a =a’ mod (@), we have a = a’ + ké for some integer k. Also, 
b = b' + mé for some integer m. Hence a-b = a’ -b! + [(a’m)é+ ('k) e+ (mke)é], 
which proves our result. 


Definition 3. Let ¢ be a fixed natural number greater than 1. For any a,b 
in Ze we define a - b to be (ab); note here that @ denotes the equivalence class, 
modulo f, that contains a. 


It is clear, from our lemma, that the binary operation - is well defined. 
Moreover, because (ab) = (ba) we have @ - b = b-@, because [a(bc)| = [ab)c] we 
have a@- (b-@) = (@- b) -, and because a = a: 1 we have G = 4-1. 

Let us compute the multiplication tables in two cases: 


(Zs, ‘) (Ze, ) 


These tables display some interesting features of this new operation, but before 
we can discuss them we need some additional terminology. To begin with, the 
element 1 is called the multiplicative identity of Zp. An element 6 of Z, is said 
to be invertible if it has a multiplicative inverse; i.e., an element @ € Zs such 
that a-b=b-a=1. 

The table for (Z5,-) shows that every nonzero element of this set has a 
multiplicative inverse. This is not true in Zg. There we see that only I and 5 
have inverses, whereas 2,3,4 do not. Even stranger things happen in (Zg,-:). 
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The table shows that 2-3 = 6 and that 3-4 = 0, while none of the factors is 
zero. We have developed the number theory we need to explain these “weird” 
results. Again we must start with some terminology. 


Definition 4. Two elements a,b of Z; are said to be proper divisors of zero 
ifa 4045, anda-b=0. 


We observe that a proper divisor of zero could not have a multiplicative 
inverse (see Exercises 8, problem 2). 


Theorem 2. The set (Z;,-) contains proper divisors of zero if, and only if, @ 
is a composite integer (i.e., an integer greater than one that is not a prime). 


Proof. Suppose first that 2 is composite, so @ = k-nforl1<k< @andl<n< é. 
Clearly then, k 4 0 4 7% whereas (kn) = 0, showing that Z;, contains proper 
divisions of zero. 

Now suppose that there are proper divisions of zero in Zz. Let us say k,7i 
are nonzero, yet k - 7 is zero. Then kn = mé for some integer m. If @ were a 
prime it would follow that é|k, giving us k = 0, or é|n and implying that 7 = 0. 
Thus @ must be a composite integer. 


Corollary 1. When @ is a prime the set Z, does not contain proper divisors 
of zero. 


Our next result tells us which elements of Z, have, and which do not have, 
multiplication inverses. Its proof also shows the connection between the latter 
elements and the proper divisors of zero. 


Theorem 3. An element k of Z, has a multiplicative inverse if, and only if, 
the integers k and @ are relatively prime. 


Proof. Suppose that (k,@) = 1. Then there are integers z and y such that 
1 = kr + ly. Thus kx = 1 mod (@) showing that Z is the multiplicative inverse 
of k. 

Next we suppose that k and @ are not relatively prime; i.e., (k,@) =n > 1. 
Then k = kin and ¢ = ¢;n where 1 < kj < k and 1 < @, < @. It follows that 
k- 0; = (kn) -@; = k,(né,) = k,@ € (@). Thus k and @, are proper divisors of 
zero. 


Remark 1. The proof of Theorem 3 shows that when k € Z;, k # 0 and k 
is not invertible, then k is a proper divisor of zero. More precisely, there is an 
é, € Z, such that k and @, are proper divisors of zero. 
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Corollary 1. When @ isa prime, every nonzero member of Zs has a multi- 
plicative inverse. Moreover, for @,b in Ze we have a@- b = 0 if, and only if, one 
of these factors is zero. 


Remark 2. Given a natural number m let E(m) = {k € N]k < m and 
(k,m) = 1}. Euler defined a function y = N — N by setting, for each m € N, 
y(m) equal to the number of elements in the set €(m). This is called the Euler 
phi function or, sometimes, the Euler totient function. We shall study it further 
after we recall some facts about prime numbers. 


Exercises 8 
1. Let w be a primitive nth root of unity. Compute the order of each of the 
numbers uw, 7 = 0,1,... ,n — 1 when: 
(a) n = 6; 
(b) n = 12; 


(c) n is a prime number. 


* 2. Suppose that @ and 6 are proper divisors of zero in Ze. Show that neither 
of these elements can have a multiplicative inverse in Ze. 


+3. Suppose that the integers k, a,b and n satisfy ka = kb mod n. If (k,n) = 
1, show that a = 6 mod n. Hint: In Z,, we have ka = kb. 


4. Compute y(1), y(2), y(3). y(4), and y(p) where p is a prime. 


(a) If p is not a prime show that 1-2- --- -(p—1)=0in Zp. 
(b) If p is an odd prime note that (p — 1) is its own inverse in Zp. Con- 
clude that 1-2- --- -(p—2) =1 in Z,. 


(c) Use (a) and (b) to show that p is a prime if, and only if, 1-2- --- - 
(p—1) =-1 mod p. This is called Wilson’s theorem. 


9. Prime Numbers 


Primes have been investigated since ancient times. A great deal is known about 
them, and yet many mysteries remain. To begin our discussion let us restrict 
our attention to the set N = {1,2,3,...} of natural numbers. 

Given n and d in N we shall say that d is a divisor, or a factor, of n, and 
we shall write d|n, if there is an m € N such that dm = n. Observe that any 
factor of d is also a factor of n. 


Definition 1. A natural number n > 1 is said to be a prime if the only 
divisors of n are n and 1. 
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Clearly 2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, ... are primes. A number n > 1 
that is not a prime is called a composite number. 


For any n € N let D(n) = {d € N]d|n}, so D(n) is the set of all factors 
of n. When n > 1 the set D(n)\{1} # @, and so it has a smallest member p. 
Observe that p must be a prime; for if 1 < q < p and q|p, then qjn is smaller 
than the smallest divisor of n. Thus every n > 1 has a prime factor. 


Theorem 1. There are infinitely many prime numbers. 


Proof. Suppose that the set of primes is finite and let this set be denoted by 
Pi, P2,--- Pk, & € N. Set 


Q= pr pr---: Pet 


and note that no p; can divide Q; for. if it did, then by Lemma 1 of Section 8, 
it would have to divide 1. But then any prime factor of Q, and there is at least 
one, is not on our list of all primes, an obvious contradiction. 


If n is a composite nuinber, then m = p,q,, where p, is the sinallest prime 
divisor of n. If g; is a prime, then n is the product of two primes. If q) is 
composite, then q,; = p2qg2, where pz is the smallest prime divisor of qg,. Thus 
n = p)p2q2. Continuing in this way we should be able to write n as a product 
of prime numbers. This is, in fact, the case as we shall now prove. 


Theorem 2. Any integer greater than 1 is either a prime or it can be written 
as a product of primes. 


Proof. We argue by contradiction. Let Mf be the set of natural numbers greater 
than 1 that are not priine and cannot be written as a prodnct of primes. Suppose 
M # @. Then there is a smallest element m of M. Clearly m is not a prime 
and so m = ab, where 1 < a < m, 1 <6 < m. But because a < m and b < m, 
neither of these numbers is in A4. Thus they are primes or can be written as 
products of primes. But then the integer m can be written as a product of 
primes, and we have reached a contradiction. 


It seems natural, given n > 1, to ask if there is more than one way to factor 
n into primes. It turns out that apart from permuting the factors, this can be 
done in just one way. 


Theorem 3. Given n > 1 we may find primes p; < p2 < --: < px and natural 
numbers @),@2,°:: , a, such that 


a1 


n= Dp, at 


P2 py 


This can be done in one, and only one, way. 
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Proof. Assume that the theorem is false. Then there is a smallest integer m > 1 
that has two distinct factorizations: m = p}'--- py", where p, < po <--: < 
pe are primes and @),...,@, are natural numbers, and m = gq? -.. ght where 
QQ, < +++ < qe are primes and 0),... ,b¢ are natural numbers. 


It follows that prilq?? +. gh and so pilq” for some 7, 1 <i < @ (Section 8, 
Corollary 1 to Lemma 3). From this we see that p,|q; and p; = q,;. A similar 
argument shows that q; = p; for some j, 1 < 7 < k. Thus 


PSpPHnluHr 
giving us p) = q). 


Now @ < m and so this number (i.c.. a) can be written, as described 
above, in just one way. But 


ue - bi-1,b b 

eae = ps? po? ++» pit =q;' 1 gb? ... gbe 

Pi 
It follows that k = f, po = qo, ..., Pr = Qe, and ay -1 = b — 1, ag = 
bo,...,@, = 6. Thus the factorization of m is unique, and we have reached a 


contradiction. 


For any n € N let us set o(n) equal to the sum of all the divisors of n. We 
observe that 


o(7) =8 <2-7, o(12) = 28 >2-12, o(6) = 12 = 2(6) 


The number r is called deficient when o(n) < 2n, it is called abundant when 
o(n) > 2n, and it is said to be perfect when o(n) = 2n. This terminology goes 
back to the numerologist of antiquity. 

Perfect numbers were discussed by Euclid, and he gave a means of gener- 
ating then. He considered numbers of the form 


n = 2P-'(2P — 1) 
When 2? — 1 is a priine, the factors of n are 
1,2,27,...,2?-', 1-(2?-—1), 2-(2?-~1),..., 2?7!.(2?-1) 
and so 


a(n) =1+24+---42P7!) 4 (2? -1)(1+24+---4+2P7!) 
= (2? — 1) + (2? — 1)(2? — 1) 
= (2? — 1)[1 + (2? — 1)] = 2°(2? — 1) = 2n 


To sum up these observations: Whenever 2?—1 is a prime, the number 2?~!(2?— 
1) is perfect. 
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We turn now to the question of when 2? — 1 is prime. It is easy to see that 
when n is composite so also is 2" — 1 (see Exercises 9, problem 5). Thus we 
need only consider the numbers 


M, = 2? -1 


where p = 2,3,5,7,11,.... These numbers were discussed by the French cleric 
Mersenne and have come to be called Mersenne numbers. Note that Mz = 
27-~1=3, Mz = 22-1=7, Ms = 25-1 = 31, M7 = 2’ — 1 = 127 are 
all primes. Unfortunately, M,, = 2'! — 1 = 2047 = (23)(89). Still the first 
four Mersenne numbers give us four perfect numbers: 2?-!(3) = 6, 23-!(7) = 
28, 25°-'(31) = 496, and 27-1(127) = 8128. 

In 1876 Lucas showed that 


M27 = 170, 141, 183, 460, 469, 231, 731, 687, 303, 715, 884, 105, 727 


is a prime. It is now known that Mj1913, Mi9737, Mo1701, and Mo3209 are 
primes. 

Given a prime p the Mersenne number M, may, or may not, be a prime. It 
is useful then to have some test, especially one that can be easily programmed 
on a computer, to determine when M, is a prime. Such a test was discovered by 
Lucas and put in final form by Lehmer. He showed that M, is a prime if, and 
only if, M,|Sp-1 where the numbers S,, are defined as follows: S; = 4, Sp41 = 
S? —-2, n=1,2,3,.... 

We shall give a proof of the “interesting” half of this result; i.e., that when 
M,|Sp-1, Mp is a prime. A few preliminary results are needed here. 

Lemma 1. Let a = 2+ V3, @ = 2- V3. Then a&@ = 1 and S, =a?” + 


—_9m—l 
a e 


Proof. When m = 1 we have S; = a+4@, which is clearly true. Assume 
that there is a smallest integer k + 1 for which our formula fails. Then, for 
m=1,2,...,k the formula is true. Thus 


Sk+i = S~-2= (a? +a)’ -2 
= (0) 4 at0n" + (0) 2 
=a" +a” 
because a@ = 1. This is a contradiction and the lemma is proved. 
Suppose now that M, divides S,_,. Then M, must divide ae? 4 qe? 


and so this sum is equal to RM, for some integer R. Multiplying by a2?" gives 
us 


a?" = RM,a?”* ~1 (1) 
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and squaring this gives us 
= (RM,a?”" - 1)? =KM,+1 (2) 


for some integer AK. Assume that +], is a composite number. Then M, has 
a prime factor q with g* < M, (see Exercises 9, problem 2). Recalling the 
definition of M, we see that q # 2. 


Define a set X = {a + bV3|a,b € Z,}. Clearly, X contains q? elements, 
among them 0 and 1. For z,y in X we define z- y as follows: 2 = a+ bV/3, y = 
c+dV3, x-y = (ac-+3bd) +(ad-+be) V3, where the terms ac F500) and (ad+ bc) 
are found in Z,. Clearly, r-y € X. For any x € X we define rz! = zr, x? = 2-2, 
etc., and we shall say that zx is invertible if there is a y € X such that z-y = 1. 
Let Y be the set of invertible elements of X and note that Y contains at most 
q? — 1 elements. 


Lemma 2. For each y € Y, y” = 1 for some natural number r < q? — 1. 


Proof. If y € Y, then each of the elements y, y?,y°,... ,y% —! is in Y. There 
are two cases: (i) If the numbers listed are all distinct, then one of them must 
eqnal 1 (because Y contains at most g? — 1 members) and we are done. (ii) If 
yf =y™ for 1 << ™m, then 1 = y™~‘, and again we are done. 


For each y € Y let O(y) be the smallest positive integer such that yO") = 1. 
Then if r is any integer such that y” = 1 we must have O(y)|r (see Section 6, 
the discussion just below Definition 5). 

Now a € X and because q| Mp, a2””" = -1 mod q by (1) and a?” =1 
mod qg by (2). Thus a € Y and O(a)| 2?, and O(a) cannot be less than 2? 
because a?” ' = —1. (Note that because q > 2 we do not have 1 = —1.) Thus 


O(a) = 2? <q? -1< M,-1=2?-2 


and we have reached a contradiction. 


Exercises 9 
1. If d|n and r}d, show that r|n. 


* 2. If n is a composite number and p is the smallest prime factor of n, show 
that p? <n. 


3. Factor the following numbers into primes: 


(a) 1001 
(b) 756 
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(c) At some point in his life, Euler became blind. It was after that that 
he factored 1,000,009 into (293) (3413). 


4. For any n > 1 consider > 4 where the sum is over all divisors d of n. 
Show that this sum is less than 2 when n is deficient, greater than 2 when 
n is abundant, and equal to 2 when n is perfect. Hint: Compute > 5. 


*5. When n is composite, show that 2” — 1 is composite. 


6. Euler showed that every even perfect number is the form 2?—'M,, where 
M, is prime. At the time of this writing, it is not known if odd perfect 
numbers exist. 


10. Euler’s Phi Function 


Given m € N the number of invertible elements of Z,, coincides with the 
number of integers k such that 1 < k < m and (k,m) = 1. This is also the 
number of primitive mth roots of unity. As we have seen (Section 8, Remark 
2) the function y(m) gives us this number. Our purpose here is to derive a 
formula for computing y(m) from m. 


Lemma 1. Fix 6 € N and consider the set Z,. It consists of the equivalence 
classes 0,1,... ,b— 1. Suppose that r > 0 is an integer and a € N satisfies 
(a,b) = 1. Then 7,(a+1r),(2a+1r),... ,((b—1)a +r) are all the elements of 
Zo. 


Proof. The list given contains b elements of Z,. It suffices only to show that 
these elements are distinct. Now if sa+r =ta+r mod b, where 0 < s,t < b—1, 
then sa = ta mod b. However, because (a,b) = 1 this tells us that s = ¢ mod b 
(Exercises 8, problem 3), and this in turn, because 0 < s,t < b— 1 gives us 
s=t. 


To illustrate the power of Lemma 1, we use it to prove a result that has 
found application in securing computer files. 


Theorem 1 (Fermat). Let a,p be natural numbers and suppose that p is a 
prime. If p does not divide a, then a?~! = 1 mod p. 


Proof. Because p does not divide a it cannot divide any of the numbers 
a, 2a, 3a,... ,(p — 1)a. By Lemma 1, with r = 0, the set G, 2a, 3a,... ,p— la 
and 0, coincides with Z,. It follows that @,2a,... ,(p— 1)a coincides with the 
set 1,...,p—1. Thus, in Z,, we have 


OTT E pl =T2p=1 
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or, equivalently, by 
a?-!.1.2.--(p—1) =1-2:--(p—1) modp 
But (1-2---(p—1), p) =1 and so by Exercises 8, problem 3, we must have 
a’-'=1 mod p. 
Corollary 1. If p is a prime and a is a natural number, then a? =a mod p. 
Let us return now to our discussion of the phi function. 
Theorem 2. Ifa,bare natural numbers and (a, b) = 1, then y(ab) = y(a)y(b). 


Proof. Consider an integer k, 0 < k < ab. We may write k = aq +r, where 
0 < r < a—1. But note that we also have 0 < gq < 6-1. In fact, each 
k, 0< k < abcan be written in one, and only one, way as k = aqg+r with q 
and r as given above. 

Now given k, 0 < k < ab we ask if (k,a) = 1. This may be written 
(aq +r. a) = 1. and this is true if, and only if, (r,@) = 1. There are ¢(a) 
numbers r that are sinaller than a and prime to a. Choose and fix one of them, 
say 1}. 

By Lemma 1 the list r},a@+1r),2a+r).... ,(b—1)a+r), contains b numbers 
distinct mod 6. Hence this list contains exactly y(b) numbers that are prime 
to b. 

Thus each of the y(a) numbers r gives us y(b) numbers that are prime to 
both a and 6. But the number of integers that are prime to both a and 6 is 
y(ab) and, as we have just seen, there are (a)y(b) such numbers. 


Corollary 1. If pis a prime and n is a natural number, then y(p") = p"(1 —- 


p) 


Proof. The natural numbers that are < p” and not prime to p” are p, 2p, 3p, ... 
p"-"p. There are then, p”~! such numbers. Hence the number of natural num- 
bers that are < p" and prime to p” must be p” — p"—! = p"(1 - 5): 


We have seen that any natnral nnmber m > 1 is either a prime or it can 
be written as a product of primes in “essentially” only one way. When m is a 
prime »(m) = m — 1. When m = p}' --- p;* we have: 
Corollary 2. v(m) = y(pt' ...py*) = m(1 — 5-)--- (1 - 52). 
Proof. It is clear that (pf, p;’) = 1 when i # j. Thus 


(py! -:: Py") = 9(py')--- o(D,*) 
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By Corollary 1 we know that (p;’) = p;’(1 - >~). Hence we see that 


3 


l l l 
m) =p (1 —- — )- pt? (1 —- — vit (1- =) 
y(m) vi ( ~) py ( ~) P; De 
l l 
— nt! .nf2... ek Le es 1-=) 
ee a re ( x) ( Pk 
l 


When p, a are natural numbers and p is a prime, the assertion that p does 
not divide a means (p,a) = 1. Fermat showed that we then have a?-! = 1 
mod p. Recall that y(p) = p — 1 so we have a®'?) = 1 mod p. Euler showed 
that for any natural number m if (m,a) = 1, then a¥(™) = 1 mod m. This 
generalization of Fermat’s result also plays a role in securing computerized 
information. We shall discuss this application further after we prove Euler’s 
result. 


Theorem 3. Let a,m be natural numbers and suppose that (a,m) = 1. Then 
a¥(™) =] mod m. 


Proof. Let the n = y(m) natural numbers that are < m and prime to m be 
denoted by a), @2,... ,@n. Note that (aa;,m) = 1 for j = 1,2,... ,n. Further- 
more, if aaj = aa; mod m, then aj = a; mod m by Exercises 8, problem 3. 
Because 1 < aj,a; < m, this means a; = a;. Thus the n elements of Zyn, 
@a),@Q@2,... ,@a,, are distinct and each of them is invertible in Z,,,. Hence they 
must coincide with @;,@3,... ,@,. But then, in Z,,, 


(Gay) - (@az)--- (Gan) = @) -G2--- An 
or, equivalently, 


a” -@) -Q2°*:Qn = Q,Q2°-:a, mod m. 


Finally, because (a) ---a@,, m) = 1 we may again use problem 3 of Exercises 
8 to conclude that a” = 1 mod m. 


Let us finish up with a brief discussion of how the theorems of Ferinat 
and Euler are used in connection with the problem of coding and decoding 
confidential messages. We shall assume that our message must first be converted 
into an integer M. We must first encode M to obtain the encoded message E. 
This is then sent to our correspondent who must, somehow, derive M froin E. 

As a first example, suppose we choose a prime r and a natural number s 
such that (s,7 — 1) = 1. Given M we encode it by setting E = M?. 
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The problem now is to obtain M from E. So we want to find t so that 
E‘ = M*' = M mod r. Suppose we can find t so that st = k(r — 1) +1 for 
some integer k. Then 


E* = (M*)! ae M*t = MA(r-1)41 oe (ur-1)* M 


Because r is a prime, Fermat's theorem tells us that M"-' = 1 mod r and so 
E*=M mod r as reqgnired. 

To find t so that st = k(r—1)+1, we use Euler’s theorem. The equation says 
that st = 1 mod (r—1). Now s¥"—") = 1 mod (r—1) provided (s,r—1) = 1, 
and we chose s so that this is true. Hence all we need to do is take t = s?("-)-!, 
With this choice st = s?"—") = 1 mod (r — 1) as required. 

As an example, let us take r = 17 and s = 3. Then (3, 17 -— 1) = (3,16) = 1 
as required. Given M we find E = M® and send this to our correspondent. 
To decode he must have ¢ such that st = 1 mod (r — 1). We have seen that 
t = s¥("-N)-1, Now r = 17 sor — 1 = 16 and so ¢(r — 1) = (16) = 8. Thus 
t = s’ = 3’ = 2187. We may reduce this by noting that 2187 = 11 mod 16. 
Thus to find M we simply compute E"! mod 17. 

A variant on this technique concerns “public access” codes. Here the r is 
published openly. It is a many digit number that those using the code know is 
the product of two very large primes p and q. One chooses s so that (s, 9(r)) = 1 
and encodes the message, A/, as before. So E = AM]? is sent. 

We now seek ¢t so that E' = M*' = M modr. Again we want st = 1 
mod y(r) or st = k[y(r)]| + 1 for some k. So we take t = s¥(¥(-))) Then 
st = s?(¥(r)) = 1 mod y(r) by Euler’s theorem. 

Finally, Et = M* = M*e(r)+! — [Agelr))k Af = M mod r because M¥(") = 
1 mod r. 

Note that because the receiver knows that r = pq. s/he also knows y(r) = 


(p— 1)(q—- 1). 
Exercises 10 


1. Prove Corollary 1 to Fermat’s theorem. 


2. Compute y(m) for the following numbers: 
(a) m = (64)(81)(625); 
(b) m = 21, 
(c) m = 7? -19-37?. 


3. If pis a prime, show that ¢(1) + y(p) + o(p*) + - + p(p") = p”. 


4. There is a conjecture, unproven at the time of this writing, that n is a 
prime if, and only if, (rn) | (n — 1). 
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Notes 


1. The axiom of choice, and its many equivalent forms, is discussed in “Gen- 
eral Topology” by John L. Kelley (D. Van Nostrand Company, Inc., 1955). 
See pp. 32-35. 


A proof of statement (*) used in the discussion of Weiner’s theorem (Sec- 
tion 2) can be found in “Introduction to Topology and Modern Analysis” 
by George F. Simmons (McGraw-Hill Book Company, New York, 1963). 
See p. 321. See also pp. 308-310 for a further discussion of related ideas. 


Another proof of Wiener’s theorem is given in “An Introduction to Har- 
monic Analysis” by Yitzhak Katznelson (John Wiley & Sons, Inc., New 
York, 1968). This is done on p. 202 where (*) is also proved. 


2. The Fourier transform on R, using Lebesque integration, is treated in 
Katznelson’s book (Note 1 above) and in the book “Fourier Transforms” 
by R. R. Goldberg (Cambridge University Press, New York, 1961). Gold- 
berg, in an appendix, discusses the Fourier transform on a locally compact 
abelian group, and Katznelson has a chapter on this topic. A more ex- 
tensive discussion can be found in “Abstract Harmonic Analysis” by E. 
Hewitt and K. A. Ross (Springer-Verlag, New York, 1963). 


3. Lagrange’s work on the theory of equations is discussed in detail by Harold 
M. Edwards in his book “Galois Theory” (Springer, New York, 1984). 
See p. 2 and pp. 18-22. Edwards also discusses some related work of 
Vandermonde and, on p. 19, gives a brief (and to me at least. surprising) 
biography of Lagrange: “Unlike Vandermonde, who was French but did 
not have a French name, Lagrange had a French name but was not French. 
He was Italian (he was born with the name Lagrangia and his native city 
was Turin) and at the time of the publication of his Réflexions he was a 
member of Frederick the Great’s Academy in Berlin. When he left Berlin 
in 1787 he went to Paris, where he spent the rest of his life and where, 
of course, he was a leading member of the scientific community; this has 
tended to reinforce the impression that he was French. He was certainly 
the greatest mathematician of the generation between Euler and Gauss, 
and, indeed, has a secure place among the great mathematicians of all 
time.” 


4, Our definition of the set X and its algebra leaves something to be desired 
(see Chapter Five, Exercises 3, problem 2). The interested reader can also 
consult the article by J. W. Bruce, “A Really Trivial Proof of the Lucas- 
Lehmer Test” in The Amertcan Mathematical Monthly, vol. 100, number 
4, April 1993, pp. 370-371. 


Mersenne numbers are useful in digital signal processing, and certain finite 
Fourier transforms are sometimes called Mersenne number transforins (see 
2 in the Notes to Chapter 5). 
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5. The Euler Phi function is discussed in “Introduction to the Theory of 
Numbers” by Leonard Eugene Dickson (Dover Publication, Inc., New 
York, 1957), see pp. 7 and 8, and in “Number Theory in Science and Com- 
munication” by M. R. Schroeder (Springer-Verlag, New York, 1986) on 
pp. 113-116. Schroeder also discusses using the phi function for “Public- 
Key Encryption” on pp. 118-126. 

An application of Euler's function to signal processing is given in a paper 
by N. K. Bose and Kim in the JEEE Trans. Signal Processing, vol. 39, 
no. 10, Oct. 1991, pp. 2167-2173. 


Chapter Five 


Abstract Algebra 


Many of the topics discussed earlier in the text can be treated more systemati- 
cally by bringing in some ideas from modern algebra. To do this, we must raise 
the level of abstraction a little bit. The gain in insight, however, is well worth 
the price we must pay to achieve it. In this last chapter, we very briefly dis- 
cuss groups, rings, algebras, and fields and indicate how they arise in harmonic 
analysis. 


1. Groups 


Groups are found in many areas of mathematics—we have seen lots of them 
already—and also in physics and certain parts of chemistry. One might expect 
that a concept with snch a wide range of applicability would be defined ab- 
stractly. This is the case, and it takes some getting used to. To begin with, 
there is only one binary operation (see below) in a general group. This can be 
something very familiar like addition or multiplication or it can be something 
more exotic like convolution or function composition. The examples below make 
this clear. 

When we say that a nonempty set G is closed under a single-valued binary 
operation, we mean that we have a function y : G x G — G. It is convenient, 
and customary, to denote y(g,h) by goh (read “g circle h”) for each pair (g, h) 
€ G x G and to say that G is closed under the binary operation 0; we shall 
always assume that 0 is single valued. 


Definition 1. Let G be a nonempty set that is closed under a binary oper- 
ation o. We shall say that the pair (G,o) is a group if (a) for all g,h,k in G 
we have go (hok) = (goh)ok; (b) there is an element e in G such that 
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goe =eog = g for every g € G; (c) for each g € G there is an element h € G 
such that goh =hog=e. 


We call (a) the associative law and say that the operation o is associative. 
The element ¢, whose existence is postulated in (b), is called the identity of G. 
The element h discussed in (c) is called the inverse of the given element g € G 
and is usually denoted by g—!; we shall see that, given g, the inverse element 
is uniquely determined. 

The pair (C,+) is clearly a group because we know that addition of complex 
numbers is an associative operation, that C has an identity (the number 0), and 
that each z € C has an inverse (the number (—z)). 

The pair (C, -) is not a group. It is true that multiplication of complex nunn- 
bers is associative, and the number | is the identity for this operation. However, 
the number zero, which certainly belongs to C. does not have a multiplicative 
inverse. It is easily seen, however, that (C\{0},-) is a group. 

Let us show how (a). (b), and (c) of Definition 1 imply that the identity 
and inverses are unique. 


Lemma 1. In any group (G, 0) there is only one identity element e. Further- 
ore, given g € G there is only one element h € G such that goh = hog=e. 


Proof. We are assuming that go e = e€0g = g for all g € G. Suppose that e’ 
is an other element of G for which go e’ = e' og = g for all g € G. Then, in 
particular, e = eoe’ = e’ oe and e’ = e’0e = ee’. Because € oe’ = €' 0€ we 
see that e = e’. 

Now let g € G and suppose we have two elements h, k in G such that goh = 
hog =e and gok = kog =e. Then h = hoe = ho(gok} = (hogjok =eok =k 
and we are done. 


Corollary 1. For any g € G, (g-')"' = g. 


l 1 1 


Proof. Given g € G there is an element g~* such that g 0 g7 
But this says g is an inverse of g~'. Now the inverse of g7! 
(g-!)~". Thus, by Lemma 1, g = (g~')". 


=g'og=e. 
is, by definition, 


Definition 2. Let (G,o) be a group and let H be a nonempty subset of G. If 
(H,°) is a group we shall say that H is a subgroup of G. 


If H is a subgroup of (G,o), then because H # © there is an element 
h € H. But because (H,0) is a group, h~' must belong to H. Furthermore, 
because H is closed under the operation 0, hoh~! =e is in H. 


Lemma 2. Let (G.0o) be a group and let H be a nonempty subset of G. Then 
H is a subgroup of G if, and only if, goh-' € H whenever g and A are in H. 
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Proof. Our discussion just above this lemma shows that our condition is nec- 
essary. Suppose now that H satisfies this condition. Then because H # @ we 
can choose h € H and note that e = hoh-! € H. Thus (H,0°) satisfies (a) 
and (b) of Definition 1; we have just proved (b), and (a) follows froin the fact 
that (G,o) is a group. To prove (c) choose any h € H and note that because 
e € H, h~'! =eoh"' is in H. Finally, we must show that H is closed under 
o; i.e., we must show that goh € H whenever g,h € H. So let us suppose 
g.h € H are given. Then h~! € H as we have seen. Hence go (h-!)~" eH 
and, by Corollary 1, this is just go h. 


We have seen that (C, +) is a group. By Lemma 2 then a nonenipty subset 
A of C is a subgroup of C if (2a — b) € A whenever a,b are in A. Referring 
to Chapter Four, Section 4, Definition 1, we see that what we have called an 
additive group is simply a subgroup of (C, +). 

Similarly, because (C\{0},-) is a group, a nonempty subset M of (C\{0}) 
is a subgroup if z-y~' € M whenever z and y are in M. Referring to Definition 
2 of Chapter Four. Section 4, we see that what we have called a multiplicative 
group is simply a subgroup of (C\{0}, -). 


Definition 3. A group (G,°) is said to be an abelian group, or a commutative 
group, if the binary operation o satisfies the commutative law; i.e., if goh = hog 
for all gh EG. 


Because addition and multiplication of complex numbers are commutative 
operations, all subgroups of (C, +) and of (C\ {o},-) are abelian groups. 


Recall that given an additive group A and a subgroup C of A we said that 
elements a,b of A were congruent modulo C’, and we wrote a = bmodC, when 
(a — b) € C (Chapter Four, Section 5, Definition 1). We showed that this gives 
us an equivalence relation on A, and we denoted the set of equivalence classes 
by A/C. In the case of a multiplicative group M with subgroup N, we defined 
$ =tmodN to mean s-t~' € N (Chapter Four, Exercises 5, problem 1). In 
this case too we get an equivalence relation on M, and we denoted the set of 
equivalence classes by M/N. Not surprisingly, we can do this for any group G 
and any of its subgroups H. We sketch the process here and leave the details, 
which are almost identical to those in the special cases mentioned above, to the 
reader. 


Given a group (G,o) and a subgroup H of G we define for g, hE G. g= 
hmod H to mean that goh™! € H. We say g is congruent to h modulo H. 


Lemma 3. Congruence modulo H is an equivalence relation on G. 


The set of equivalence classes of G defined by the relation congruence mod- 
ulo H is denoted by G/H. We have a natural mapping yy :G — G/H defined 


168 Chapter Five s Abstract Algebra 


by setting 1; (g) equal to the equivalence class of G (i.e., the element of G/H) 
that contains g, for each g € G. Calling this equivalence class [g] we have 
yu (9) = [9] for each g € G. 


Definition 4. Let (G,o) be an abelian group and let H be a subgroup of g. 
For each [g] ,[h] in G/H define [g] * [h] to be [g 0 A]. 


Lemma 4. Referring to Definition 4 we claim that (G/H,*) is an abelian 
group. 


Proof. We inust first show that * is well defined. So suppose we are given [g] , [h] 
in G/H and we choose two elements g’, g in [g] and two elements h’, h in [h]. We 
must show that g’ oh’ = gohmod H, for then we have [g’ 0 h’] = [go h]. Now 
g/0g7' and h’oh~" are in H and so their “product,” (g! 0 g~')o(h’ oh~!), isin 
H. But (9! 0g7')o(h’oh~') =g/o[g7! 0 (h’oh')] =g'0[(g-! oh’) oh], 
and because (G,0) is abelian g~'oh’ = h’og~'. Hence (g! og™')o(h'oh~') = 
go [(h’og-') oh-"] = g'o [h’'0 (g-! 0h-")| = (g' oh’) 0 (go h)~' because 
(goh)7'=h-!0g7! =g"' oh"! in this case. 

The fact that * is associative and commutative follows from the fact that 
o has these properties. More explicitly, 


[g] + ({h] + [A]) = [g] * [ho k] = [go (hok)] = [(goh) ok] = [g oh] [hk] = 
([g] *[h]) * [A] and [9] + [h] = [go A] = [hog] = [A] + [g] for all g,h,k in G. 


The identity element of G/H is just [e] because [g] * [e] = [go e] = [g] and 
[e] * [g] = [e o g| = [g]. Finally, for any 


[h] € G/H, [h] « [A7'] = [hoh7"] = [e] = [h7! oA] = [AW] « [A 
and so [h]~! = [Ao]. 


Corollary 1. For any fixed n € N, (Z,,+) is an abelian group (Chapter 
Four, Section 5). 


There are many other familiar mathematical objects that are groups. Any 
vector space, for instance, is an abelian group under addition (Appendix A). In 
particular then, the set of all 2 x 2 matrices with real entries is an abelian group 
under addition. The nonsingular 2 2 matrices are a group under multiplication, 
but this one is nonabelian. Another example of a nonabelian group that we have 
considered is the set of permutations on {1,2,3} (Chapter Four, Exercises 7, 
problem 2). Here the binary operation is function composition. and because 


aob= (123) =c, whereas boa = (3 7 3) =d, we see that aob # boa. 
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Exercises 1 


1. Let (G,o) be a group and let H be a subgroup of G. Show that congruence 
modulo H is an equivalence relation on G. Do it in steps: (a) show that 
for any g € G.g = ginod H;; (b) if gh € G and g = hmod H., show that 
h = gmod H; (c) if g,h, 4 are in G and g =h, h =k, show that g = k. 


2. Consider the set (Z,,,-) for n fixed. Let E,, be the set of all elements of 
this set that have inverses. Show that (£,,-) is an abelian group. 


3. Let (G,o) be an abelian group and let H be a subgronp of G. For each 
g €G let pa (g) = [gy] € G/H. So yy : G - G/H. 


(a) Show that yy (goh) = gy (g) * pH (A) for all g,h in G. 
(b) What is {9 € G| yu (9) = lel}? 


2. Morphisms 


We have had occasion to consider mappings c : Z — T such that c(m +n) = 
c(m)-c(n) for all m,n € Z, and also mappings c : Z, — T that have this 
property (Chapter Four, Section 6, Definition 2 and Definition 3). Mappings of 
this kind play a fundamental role in the general theory of groups. 


Definition 1. Let (G,o) and (G’,+*) be two groups. A function y : G — G’ 
is said to be a homomorphism from G into G’ if y(g oh) = o(g) « y(h) for all 
gh in G. 


In Chapter Four, Section 6 we defined certain maps that we called the char- 
acters of Z and of Z,,. We see now that these are simply the homomorphisms 
fron (Z,+) and (Z,, +), respectively, into (7, -). 

The basic properties of any homomorphism are easily proved. 


Lemma 1. Let (G,0o) and (G’,*) be two groups and let y : G — G’ be a 
homomorphism. Then (a) y(e) = e’ here e is the identity element of G and e’ 
is the identity of G’; (b) for every g € G, o(g7') = (g)7!: (c) the set Ker 
y = {g € G| y(g) = e’} is a subgroup of G. 


Proof. For any g € G we have y(g) = y(goe) = v(g) + p(e). Multiplying 
on the left by y(g)~! we see that e’ = y(e) as claimed. This proves (a). Now 
let g € G be given. We have g~! 0g = gog™! =e and so y(g~') * v(g) = 
v(g) *~(g~') = e’ by (a). It now follows from Lemma 1 of the last section that 
o(g~') = v(g)7!. This proves (b). 

The set Ker y is nonempty because, by (a). it contains e. To show that 
it is a subgroup we need only show that when g.A are in this set so also is 
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g°h7* (Section 1, Lemma 2). But Ker y(g) = {9 € G| y(g) = €’}, so all we 
must do is show that y(goh—!) =e’ given that y(g) = e’ and y(h) = e” Now 
y(goh-*) = (g) # y(h-") = y(g) *[p(A)]~* =e’ *(e’)”' =e’, here we have 
used (b). 


Definition 2. Given a homomorphism y : G — G’, the set Ker y = {g € 
G| y(g) = e’}, where e’ is the identity element of G’, is called the kernel of yp. 


We have just shown that the kernel of any homomorphism on G is a sub- 
group of G. 


Corollary 1. For g,h € G we have y(g) = (h) if, and only if, goh™' € 
Ker yp. 


Proof. Suppose that y(g) = y(h) then y(g) * y(h)~! = e’, but because ¢p is 
a homomorphism, this can be written as y(g) #y(h)' = y(g) #9 (h7) = 
y(goh-"), and because this is equal to e’, goh-' € Ker . 

Now suppose that go h™? is in the kernel of y, then e’ = y(goh7') = 
y(g) * 9(h—") = v(g) * p(h)—!. Multiplying on the right by y(h) we see that 
y(h) = 9(9). 


Corollary 2. The homomorphism y is one-to-one if, and only if, Ker y = {e}. 


Proof. Because we always have y(e) = e”, assuming that ¢ is one-to-one 
immediately implies Ker y = {e}. Conversely, if we assume Ker y = {e}, then 
(9) = ¥7(h) tells us that go h~! =e and hence that g = h. 


Definition 3. A homomorphism between two groups is called a monomor- 
phism if it is one-to-one . A monomorphism that is also onto is called an isomor- 
phism. Finally, two groups are said to be isomorphic if there is an isomorphism 
froin one of these groups onto the other. 


The set R is a group under addition and the set R,, the set of positive real 
numbers, is a group under multiplication. Note that f(z) = e” is an isomor- 
phism from the group (R,+) onto the group (R,,:) and that g(z) = Inz is 
an isomorphism from (R,,-) onto (R, +). Long before the advent of the com- 
puter the fact that the log function is an isomorphism was exploited to simplify 
lengthy calculations. In these applications the base 10, the so-called common 
logarithms, was used. 


Let (G, 0) be a group, and let K be a subgroup of G. Two elements g, h of G 
are congruent modulo K if goh™? is in K, and this is an equivalence relation on 
G (Section 1, Lemma 3). We want to characterize the equivalence class defined 
by an element g € G; i.e., the set {h € G| g = hmod K} that we denoted by 
[g]. Note that A € [9] if, and only if, hog~! € K. Thushog”'=k€e K, and 
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so h = kog. This shows that [g] C {kog|k € K}; let’s call this last set Kg. 
If h € Kg, then h = kog for some k € K and hence hog"! =ke K; ie., 
Kg C |g]. So we have, for any g € G, [g] = Kg. We will use this very soon. 
Note that (g] « [h] = (Kg) * (Kh) = K (goh). 


Theorem 1. Let (G,o) and (G’, *) be two abelian groups and let y : G — G’ 
be a homomorphism from G onto G’. Let K = Kery. Then the groups (G/K, *) 
and (G’,*) are isomorphic. Furthermore, the map y~ : G/K — G’ defined by 
wv (Kg) = ¢(g) is an isomorphism between these two groups. 


Proof. We first show that y is well defined. Suppose that Kg = Kg’ for g,g’ 
in G, we must show that #(Kg) = 9(g) = 9(9') = w(Kg’). But because 
Kg = Kg’ ande € K (e is the identity of G), we see that g = kog’ for some 
k € K. Hence v(g) = p(k og’) = p(k) *p(9') =e * o(9’) = (g’), where e’ is 
the identity of G’, and we have used the fact that k € K = Ker ¢. Thus y is 
well defined. 

Given g’ € G’ there is a g € G such that y(g) = g’, because y is onto. But 
then w (Kg) =’ showing that a is onto. 

Suppose that ~ (Kg) = ¥(Kg’). Then, as we saw in the first paragraph of 
this proof, y(g) = y(g’) and so y(g)*y(g’)”' =e’. This says go(g’)”' € Ker 
y = K. Hence Kg = Kg’, and we have shown that yw is one-to-one. 

Finally, »((Kg) *(Kg’)] = ¥[K (9°9')] = o(gog') = v(g9) * (9') = 
wb (Kg) *¥(Kg’). 


Corollary 1. For any fixed n € N, the groups (Z,,, +) and (U,, -) are isomor- 
phic. 


Proof. Let w be a primitive nth root of unity. Define y : Z — U,, by setting 
y(k) = w* for every k € Z. Clearly, y is a homomorphism because 9 (k + 1) = 
git = wk = y(k)-y/(l). It is onto because w is primitive. Note that Ker yp = 
{kE Z| p(k) =1} = {kEZ|w* =1} = {ke Z|k=nl for some! € Z} = 
{nl |l€ Z} =(n). Thus Z/(n), which is (Z,,, +), is isomorphic to (U,,-). 


Remark 1. Theorem 1 is also true for nonabelian groups. This is because 
the kernel of a homomorphism is a special kind of subgroup, called a normal 
subgroup. We shall not discuss these matters any further (but see Exercises 2, 
problem 4) except to say that when G is nonabelian, G/H need not be a group. 


Exercises 2 


1. Let (G,o) and (G*, *) be two groups, and let y : G — G* be a homomor- 
phism. Show that Im y = {¢(g) | g € G} is a subgroup of (G*, *). 


172 Chapter Five ow Abstract Algebra 


2. Let (G.o) be a group, and let y : G — G be an isomorphism. Because 
aps G onto itself, we call y an automorphism. 


(a) Show that H = {g € G| y(g) = g} is a subgroup of G. 

(b) Define y : (C, +) — (C, +) by setting y(z) = z (the complex conju- 
gate of =) for each z € C. Show that this is an automorphism and 
identify {z € C | p(z) = z}. 


3. Show that f (x) = e'? is a monomorphism from (R,+) into (C\ {o},-) 
and identify Im(/). 


4. Given a group (G.o) we may define, for each fixed g € G, yg(h) = 
g 'ohog for all heG. 


(a) Show that y, : G — G is an automorphism (problem 2 above). 


(b) A snbgroup H of G is said to be a normal subgroup if y, (H) = 
{yy (h) | 4 € H} C A for every fixed g € G. Show that the kernel of 
any homomorphism on G is a norinal subgroup. 


(c) If (G.o) is abelian, show that every subgroup of G is a normal sub- 
group. 


3. Rings 


The subgroups of the group (Z,+) are of the form (l) = {kl | ke Z} where 
1 =0,1,2,... is fixed. These have the unusual property that n-a € (l) whenever 
a € (l) and n € Z. This is what enabled us to define multiplication in Z, 
(Chapter Four. Section 8, Definition 3, Lemma 4 and the discussion just before 
Lemma 4). We can get a better understanding of what is happening here by 
introducing another abstract algebraic structure. 


Definition 1. A ring (/2,+.-) is a nonempty set R that is closed under two 
associative binary operations + and - for which (a) (R, +) is an abelian group; 
(b) given a,b,c in R, we have both a-(6b+c) = a-b+a-c and (a+b)-c = a-c+b-c. 
If we also have (c) a:b = b-a for all a,b in R, then we may say that the ring 
(R,+.-) is commutative. 


Given a ring (R,+,-) the identity element of (R,+) is denoted by 0 and 
the inverse of any a € R is denoted by (—a). When R has had a multiplicative 
identity, we shall denote it by 1 and say that (R,+,-) is a ring with unit. To 
avoid triviality, we shall always assume, given a ring with unit, that 0 # 1. 

It is clear that (Z,+.-) is a commutative ring with unit. The set of all 2 x 2 
matrices with real entries is, under matrix addition and matrix multiplication, 
a noncomniutative ring that has a unit. Let us show that the laws of signs, that 
we all memorized in high school, are consequences of the axioms for a ring. 
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Lemma 1. In any ring (/2,+,-) we have (i) a-0 = 0-a = 0 for every a € R; 
(ii) a- (—b) = —(a- b) = (—a) -b for all a.b in R; (iii) (—a) - (—b) = a- 6 for all 
a,bin R. 


Proof. For any a,b in R we have a-b =a-(b+0) =a-b+a-0. Thus a-0=0. 
Similarly, b-a = (b+ 0)-a = b-a+0-a and so 0-a = 0. Now that (i) has been 
proved we may use it to prove (ii) and (iii). Becanse 0 = a-0 = a- (b+ (—b)] = 
a-b+a-(—b) we see that a-(—b) is the inverse, in (R, +), of a- b. But there is 
only one inverse of a-b, — (a- 6) (Section 1, Lemma 1). Thus a-(—6) = — (a- 6). 
The proof that (—a) -b = —(a- b) is similar. 

Finally, 0 = (—a) -0 = (—a) - [b + (—b)] = (-a) -b + (—a) - (—6b). Using (ii) 
we may rewrite this as 0 = —(a-b) + (—a) - (—6). But this clearly shows that 
(-—a) - (—b) is the inverse in (R,+) of —(a-b) and since that inverse is a - b, 
(-—a)-(—b) =a-b. 


In the study of a given ring, certain of its subrings are of special importance. 
We define these now. 


Definition 2. Let (R,+,-) be a ring. A nonempty subset / of RF is called an 
ideal in R if (a) J is a subgroup of (R.+); (b) for any a € J and anyre R 
both a-r and r-a are in J. An ideal J is called a proper ideal if J 4 R. 


Remark 1. We have seen that (1'(Z) , +, *) is a commutative ring with nnit 
(Chapter Four. Section 1, Definition 1, Lemma 3 and Exercises 1, problem 2). 
When we defined the ideals in this ring, however, we were looking at it as an 
“algebra over C” (Chapter Four, Section 2, Definition 2). A ring (R,+.-) is 
called an “algebra over C” if (a) (AR, +) is a vector space over C (Appendix A): 
(b) for each a,b in R and each A € C, A(a- 6) = (Aa) - 6b = a: (AB). Ina ring 
(R,+,-) we want onr ideals to be subgroups of (RA, +); in an algebra (Rf, +, -) 
over C we want our ideals to be linear subspaces of the vector space (R, +). 
The proof that (!'(Z),+,*) is an algebra over C was given in (Chapter Three, 
Section 1, example (iv) and Chapter Four, Exercises 1, problem 2(b)). 


Let us focus now on a fixed commutative ring with unit, (A,+,-), and a 
proper ideal J in R. Because / is a subgroup of the abelian group (AR. +), defining 
a = bmod/ to mean (a — 6) € J gives us an equivalence relation on R (Section 
1, Lemma 3). Also, the eqnivalence classes are the sets a+ J = {a+6|be I} 
(Section 2, the discussion just above Theorem 1). We know that R/J is an 
abelian group when we define (a + J) + (b+ J) to be (a + 6) +J. We can define 
a multiplication in R/J. What we do is set (a + J) * (b+ J) eqnal toa-b+ TJ. 
We shall now prove: 


Theorem 1. (f//.+,*) is a commutative ring with unit. The map y : R — 
R/I defined by setting y(a) = a+ TJ for all a € R satisfies (i) p(a +b) = 
(a) + y(b) for all a.b in R; (ii) p(a- b) = y(a) * y(b) for all a,b in R. 
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Proof. We already know that (R/J, +) is an abelian group and that the map y 
satisfies (a). Let us show that * is well defined. To do this we must show that if 
a = a'inod!/ and b = W’ mod/ then a-b =a’ -b’ mod 1. Now (a — a’) € I, hence 
(a —a’)-b€ 1. Also, because 6b — 0’ is in J, a’ - (b — 6’) € J. Thus we have both 
(a—a’')-b=a-b—a’'-binI and a’-(b- 0’) =a’'-b-a’-B in J. Adding we 
get a:b — a’ - bin I which says a-b =a’ - b’ mod J, and this is what we wanted 
to show. 

Observe that * is commutative for (a+ J) *(6+ J) =a-b+]=6-a+I= 
(6+ I) *(a+J) because (R,-) is commutative. Similarly, * is associative. Note 
that (a+ J) *{[(b+/)+(c+J)] = (a+ J) * ((6+c)4+/7) =a: (b+e) +] = 
(a-b+a-c)+I1 = (a-b+/)+(a-c+/), showing that (R/I,+,*) is a ring. 
The fact that (1 + J) is a unit in this ring and that » satisfies (ii) are immediate 
from the definitions. 


Corollary 1. For each fixed |, (Z;,+,-) is a commutative ring with nnit. 


Proof. Z, = Z/(\) and the set (1) = {kl | k € Z} is an ideal in(Z, +, -). Because 
this last ring is commutative and has a unit, the corollary is an immediate 
consequence of our theorem. 


Exercises 3 
1. We have seen that the ideals in (Z, +, -) are of the form (a) = {k-a|k € Z}. 


(a) Show that (a) C (6) if, and only if, b | a. 

(b) A proper ideal J in a ring (R, +, -) is said to be a maximal ideal if the 
only ideals that contain J are R and / itself. What are the maximal 
ideals in (Z, +, -)? 


2. We refer to Chapter Four, Section 9 and the discussion just before Lemma 
2. Let q be a fixed odd prime number and let X = {a + bV3 | a,b € Z,}. 
For a; + 6; V3 and a2 + 6273 in X define their sun: to be (a, + a2) + 
(b, + b2) V3, where a; + a2 and b, + b2 are computed in Zp. 


(a) Show that (X, +) is an abelian group. 


(b) Define (a) + b, V3)- (az + beV3) to be (a,a2 + 3b1b2)+(a1b2 + a2b)) 
V3, where a,a2 + 3b,62 and a,b2 + a2b, are computed in Z,. Show 
that (X,+,-) is a commutative ring with unit. 


3. Let (R,+,-) be a commutative ring with unit. For any fixed a € R let 
(a) = {r-a|reé R}. Show that (a) is an ideal in R. 


4, Show that, in a commutative ring with unit, every proper ideal is con- 
tained in a miaximal ideal (defined in 1(c) above). 
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4. Fields 


We are all familiar with (Q,+,-), (R,+,:), and (C,+,-). Each of these is a 
commutative ring with unit, but they are very special rings in that each of 
their nonzero elements has a multiplicative inverse. Such rings are called fields, 
and in this section we discuss finite fields, sometimes called Galois fields. These 
are of interest in pure mathematics, and they have proved useful in digital 
signal processing [see Note 2]. 


Definition 1. A field (F,+,-) is a commutative ring with unit in which every 
nonzero element has a multiplicative inverse. 


We have seen that (Z;,+,-) is a commutative ring with unit (Section 3, 
Corollary 1 to Theorem 1). Furthermore, by Corollary 1 to Theorem 3 (Chapter 
Four, Section 8), (Z;,+,-) is a field if, and only if, ! is a prime. 

Given a field (F, +, -) it is convenient to denote the additive and multiplica- 
tive identities of F by 0 and 1, respectively. If a € F, we set 1-a = a, 2-a = ata, 
and in general n-a = a+(n—1)-a, n = 3,4,.... Consider now the sequence 
of field elements 1-1, 2-1, 3-1,... There are Wo possibilities: We can have 
m:-l#n- i whenever iin. or we-can Linton: 1 = m-1 for some m,n with 
n <m. In the latter case we see that (m —n)-1=0. 


Definition 2. Let (F,+,-) be a field. If n- 1 #0 for all n € N, we shall say 
that F has characteristic zero. If F does not have characteristic zero, then thie 
smallest element n of N such that n-1 = 0 is taken to be the characteristic 
of F. 


It is clear that Q, R, and C each have characteristic zero and that Z, has 
characteristic p. 


Theorem 1. The characteristic of a field is either zero or a prime number. 


Proof. If (F,+,-) is a field having characteristic n > 0, then n is the smallest 
positive integer such that n-1 = 0. If n is a composite number, then n = pq 
where 1 < p< nand1<q<pn. It is clear, however, that 0 =n-1 = (pq)-1 = 
(p-1)-(q-1). If the field element p-1 # 0, it has a multiplicative inverse 
(p- 1)". Hence, recalling that (p-1)~' - 6 = 6 (Section 3, Lemma 1, (a)), we 
see that 0 = (p-1)7’-0 = (p- 1)~'- [(p- 1) (q- 1)] = q-1. But this is impossible 
because 1 < g <n and n is the characteristic of F. 


Now let (F,+,-) be a field with characteristic p # 0. Set 0:1 = 0 and note 
that {0-1,1-1,2-1,...} = {(n —1)-1|n € N} is just the set 
{0-1,1-1,...,(p— 1) i}. Furthermore, if m-1 and n-1 are in this last set, then 
m-1+n-1=(m+n)-1 and (m-1)-(n-1) =(mn)-1 provided that the sum, 
m +n, and the product, mn, are carried out mod p. 


176 Chapter Five mw Abstract Algebra 


Now we can define a map y : Z, — F by setting y(n) = n-1 for each 
né& Zp. We are using n to mean two different things. In y(n), n is an element 
of Z,, and in n-1, n is the integer n. This should not cause the reader any trou- 
ble. Introducing new notation to distinguish the two n’s seems unnecessarily 
pedantic at this point. 

Observe that g(m+n) = y(m) + y(n) and y(mn) = y(m) - y(n) for all 
m,n in Zp. We have shown that every field of characteristic p “contains” the 
field Zp. 


Theorem 2. Let (F,+,-) be a field with a finite, say g, number of elements. 
Then g = p”, where p is the characteristic of F, and n € N. 


Proof. It is clear, because F has only finitely many elements, that its charac- 
teristic must be p 4 0. Now F contains Zp, and we may regard F to be a vector 
space whose scalars come from Z, (Appendix A.) Let v1,...,Un be a basis for 
this vector space. Then every element z € F can be written in one, and only 
one, way as a linear combination of the basis elements. Hence z = jt Qjv;; 
where each a; € Zp. Note also that any such sum is in F’. Because there are p 
choices for each coefficient and there are n coefficients, F contains p” elements. 


Remark 1. It is true, although we shall not prove it, that for any prime p 
and any n € N there is a field containing p” elements. Furthermore, there is 
only one such field in that any two are isomorphic. As we mentioned above, 
such fields are called Galois fields. They are often denoted by GF (p”). 


Exercises 4 


1. Let (F,+,:) be a field and let a,b € F. If a- 6 = 0, show that either a = 0 
or b = 0. 


(a) If (F,+.-) is a field, show that the only ideals in F are {0} and F 
itself. 


(b) If (R,+,-) is a commutative ring with unit and if the only ideals in 
R are {0} and R itself, show that (R,+.-) is a field 
2. Let (F,+,-) be a field with characteristic p £ 0. 


(a) For any a,b € F, show that (a + 6)? = a? +6°. Hint: Use the binomial 
theorem and the fact that p is a prime. 


(b) Let G = {z? | x € F}. Show that (G,+,-) is a field. 
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Notes 


A very accessible introduction to modern algebra can be found in “Ab- 
stract Algebra” by J. N. Herstein (Macmillan Publishing Co., 2nd Edition, 
New York, 1990). 


. Groups, rings, fields, and their application to digital signal processing are 


discussed in “Fast Algorithms for Digital Signal Processing” by Richard 
E. Blahut (Addison-Wesley Publishing Co., Reading, MA, 1985). See pp. 
21-33. The Fourier transform in an arbitrary field is discussed on p. 31. 
There is also a treatment of finite fields (pp. 155-165). Euler’s phi function 
is discussed on pp. 156 and 157, and Mersenne numbers and associated 
finite fields are used on pp. 182-185. 


. There are many applications of group theory to various areas of physics. 


These are discussed in “Applications of Finite Groups” by John S. Lomont 
(Academic Press, New York, 1961). 


Appendix A 


Linear Algebra 


Here we shall recall some terminology froin linear algebra 


Definition 1. A nonempty set V is said to be a vector space over K if it has 
all of the following properties: 


1. There is a rule that assigns to each pair u,v of V a unique element of V 
called their sum and denoted by u+v. This rule is called vector addition. 


2. Vector addition satisfies 


(a) The Commutative Law; i.e., u+v =v-+u for all u,v in V. 


(b) The Associative Law; i.e., (u+v) + w= u+(u+w) for all u,v, 
in V. 


(c) There is a unique element 0 € V for which u +0 = 0 + u for all 
u € V. We call 0 the additive identity of V. 


(d) For each u € V there is a unique element (—u) € V such that 
u+(-—u) =(-u)+u=0. 


3. There is a rule that assigns to each pair a,v, wherea € KandveéV,a 
unique element of V called the product of a and v denoted by av. The 
rule is called scalar multiplication. 


4. Scalar multiplication satisfies. 


(a) a(Zu) = (af) u for all a, 8 in K and all ue V; 
(b) a(u+v) = au+av for all a in K and all u,v € V; 
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(c) (a+ B)u = av + Bu for all af in K and all u€ V; 
(d) lu =u for allue V. 


The elements of a vector space V over K are called vectors, the elements 
of K are called scalars, and K is called the field of scalars. 


Now let V be a vector space over K. 
Definition 2. A nonempty subset W of V is said to be a linear subspace if 


(a) For any two vectors u,v, in W the vector u + v is in W,; 


(a) For any vector w in W and any scalar a (i.e., any element of a of K), the 
vector aw is in W. 


Let S be a nonempty subset of V. A vector v is said to be a linear combina- 
tion of vectors in S if there is a finite set u),...u, in S and scalars a), ...a@, such 
that v = ~ a;uj. The set of all vectors in V that are linear combinations of 
vectors in np is denoted by lin S. It is easy to see that lin S is a linear subspace 
that contains S. We call it the linear subspace spanned by, or generated by, S. 
In particular, if lin S = V we say that S spans V. 


Definition 3. Let V,W be two vector spaces over the same K, and let T : 
V — W. Weshall say that T is a linear function, or linear map, if (i) T (u + v) = 
T (u) +T (v) for all u,v in V; (ii) T (av) = aT (v) for all a in K and all v in V. 


Corresponding to any linear map T : V — W we have two linear subspaces: 
the range of T (Definition 1 of Section 2, Chapter Zero), which is a linear sub- 
space of W; and the null space, or kernel, of T defined to be {uv € V | T (v) = 0}. 


Appendix B 
The Completion 


Definition 1. Let (£, ||-||) be a normed space. A Banach space (E,||-||,) is 
said to be a completion of (E, ||-||) if 


(a) There is a linear map T : E — E such that |r| = || for every 


TEE (we say that the map T is norm preserving); 
(b) The linear subspace T(E) = {Tz | re E} is dense in (E, lI-ll,)- 


We shall show that every normed space has a completion and that, in a 
certain sense, this completion is unique. The construction is quite straight- 
forward; it is similar to Cantor’s construction of the reals from the rationals 
but lengthy and a little bit tedious. We shall leave some of the more routine 
verifications to the reader so as not to obscure the underlying ideas. We begin 
with some simple facts about Cauchy sequences (Definition 3 of Section 2, 
Chapter Three) in a normed space (£, ||-||). 


1. if Zyy are in E, then |= _ ||| < | _ y|l 


2. If {zn} C E is a Cauchy sequence, then the sequence of real numbers 


| 


3. If {Zn} and {yn} are two Cauchy sequences in E and if. "™ (z, — Vn) = 


no 
lim 
0, then ,_.. 


Zin \. is convergent. 


— lim — 


In 


n e 


We read the proofs for the reader. 
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Theorem 1. Every normed space has a completion. 


Proof. Let (£,||-|]) be a normed space and let C’ be the set of all Cauchy 
sequences of points of E. We may define an equivalence relation (Chapter Zero, 


Section 2) on C’ by agreeing to say {zn} and {yn} are equivalent when 
In —¥n = (0. For each {zn} E€ C, let |{zn}] be the equivalence 


class (Chapter Zero, Exercises 2, Problem 1) that contains {zn}, and let E 


lim | 


be the set of all equivalence classes in C’. We can define two operations on 
the elements of E as follows: For any {z.}]. {an }] we let {z.}| + 


{un }] = [{Zn+ vn}]. and for any scalar A we let 4[{Z,}] = [{azn}]. 


It is easy to show that E, with these operations, is a vector space. 


We have seen that for any {zn} pe SP re im zal exists and gach 
Furthermore if {z a} and {y.} are equivalent, then |, sie ., (lz ll =. = 
((3) above). Thus we may define, for any aie EE, 


Mtl, = nw 20 lal 
n> © 


It is easy to see that l|-(|, is a norm on E. 


Now given any z € E we consider the sequence z ,Z,..3 ie., the element 
a} € C' such that as =z alln. Then, denoting this sequence by 


{7 
{7}.[{=}] € and ||[{Z}]l|_ = [=|]. We define a map T: E + E by 


setting Tx = z} for every z € E. It is clear that T is a well-defined linear 


map that is norm preserving (ie. 72] = =| for all z). 
Let us show that the linear subspace T(£) is dense in (£, ||-||,). Given 


{= }] € E the sequence {zn} is in C and so, given € > 0, there is an N 
such that 


whenever m,n > N. Choose, and fix k > N and consider the sequence Zk, Zk, ds 


[LJ] — [{Fe Fe J] = [{Fe— Fah 
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and 


[ACF =3] - [LF FH = MF Fh = alll 


lim | 
n—0Oo 


— 


Thus, as k — oo, the sequence 


(ears ene |e | cree |e 


converges to the given element of E for ||-||, . Because each term of this sequence 
is in T(E), we see that T(E) is dense in (E, ||-||,) as claimed. 


Note that we have just shown that for any {z.}] in E the sequence 


T21,TZ2,...T2,,... converges to {7} for ||-||,. We use this observation 


below. 
To finish the proof we need only show that (£; ||-||,) is complete. So let 


{Vn} be a Cauchy sequence in E and choose, as we may because T(E) is 


dense in E, v, € E such that 


—_— 


TiVo 


<-— 
i) 


1 


We do this for each n = 1,2.... Consider the sequence {on} contained in 
E that we have constructed. We note that 


1 
1 1 1 
1 = = 1 
n m 


1 


where we have used the fact that T is norm preserving. Because {V +} is a 


Cauchy sequence, our inequality shows that {vn} is also a Cauchy sequence. 
Set V = {on} and recall that Tv 1,T V2, ...T Ux, ... converges to V for lI-Il,- 
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It follows from the inequality 


Va _ TUn + 


| 1 


1 n 


Tin- v| 


| 


o——_ 


that the Cauchy sequence a is convergent to a point of E. 


While we have the proof before us, let us show that when (V, (,)) is an inner 
product space and ||-|| is the norm induced on V by the inner product, then 
the construction applied to (V, ||-||) gives us a space (V, ||-||) that is actually a 
Hilbert space. 

First note that when {vn} : {wn} are in C’ (the set of all Cauchy sequences 


of (V, ||-|[)) then 


(vn, wn ) —~< (Um; wm) as 


(en — Um, Wn - wm )| < | Un - v m||| 


Wn ap wm|| 


by Theorem 2 of Section 3, Chapter 3. Hence ae (Uns wn) exists in K. 


—* 00 


Furthermore, if {vn} and {wn} are equivalent, respectively, tol Un} and 
{wn}, then 
—_ _ —f/ —/ 
(en, wn) oe ( v nWn) 
showing that lim (Un; Wn ) = lim (Y'n Wn). Hence we may define, for any 


[{n}] ; |{ wn }] in V, an inner product by setting 


{0} -[{@}]) = tm (B) 


It is easy to see that the norm associated with this inner product is just 
||-l|,. Thus (V, ||-||,) is complete showing that (V.(,)) is a Hilbert space. 


Finally. we note that the map T: V — V defined in the proof of Theorem 


1 satisfies 
(Tu,Tv) = (u, v) 


—/ 


Wn- Wn 


_ —/f 
Un — Un 


< 


for all u,v in V. 
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We have proved: 


Theorem 2. Given any inner prodnct space (V, (,)) there is a Hilbert space 


(V.(.)) and there is a linear map T: V — V snch that (Tu,Te) = (a. v) 
for all u, ¢ in V (we say that T preserves the inner product), and the linear 


subspace T (V) is dense in V. 


Definition 2. Given an inner product space (V.(,)) a Hilbert space having 
the properties stated in Theorem 2 is called a completion of (V, (.)). 


Let ns now show that all completions of a normed or an inner product 
space are “essentially” the same. 


Theorem 3. Given any two completions of a normed space, there is a linear 
norm preserving map from either of these completions onto the other. Given 
any two completions of an inner product space, there is a linear map from either 
of these completions onto the other that preserves inner product. 


Proof. Let (E,||-||) be a normed space, and let (£), ||-||,) and (£2, |I-{],) be 
two completions of this space. Then we have two linear norm-preserving maps 
T, : E — E, and T, : E — Ey such that 7, (E) is dense in (Ej. ||-{],) and 
T2 (E£) is dense in (£2, ||-||,). To simplify the notation, let us simply say that E 
is a dense linear subspace of both (£), ||-||,) and (£2, ||-||,)- 


We may define a map T : E, — Eg as follows: For each r € EC E, 
we set T (7) = 7 € Eo. For y € £E,\E we find first a sequence {tn} CE 
such that limor, = y for lI-ll,. Then {zu} is a Cauchy sequence for ||-||, and 
{T Ge = rn} is a Cauchy sequence for |]-||.. We set T (7) eqnal to the 
limit of {T (=.)} for ||-ll>- 

Note that if {Zn} and {zn} both converge to V for ||-||,; {Zn = oa 
converges to zero for both ||], and |[-[].: because {Fn} and ‘ot are in E. 


Thus lim T (z.) = limT (2'.) (for I|-ll>). and we sce that. T is well defined. 
It. is easy to see that T is linear. Hence to show that. T is one-to-one (Chapter 


Zero, Section 2) we need only show that {y €E,|T(y)=0€ E,} contains 
only the zero vector of £,. Suppose that y is in this last set, and let {Fn} be 
a sequence in E that converges to y for ||-||,. But then lim In = lim T (7,) = 


0 € Ep for ||-||, and so, because [|], = ||-|, on E, lim sr, = 0 € Ey for ||-(l,. 
Thus y is the zero vector of E}. 
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Given any ze E2 we can find a sequence {zn} C E that converges to z 
for ||-||,. But then {zat converges to an element y € E for l|-I|,; because it 


is Cauchy for E,. Clearly then, T (y) = 2, and we have shown that the map 
T is onto. 
Finally, let y € £,, and consider | 


yl and Ir7@)]. We have T(y) 


~— 
In 


lin T (7, ) where In} C E and limz, = y for lI-I|,- But then lim | 
lI, and lim |Z. F = lim 7 (z.) ie |r (y) l, (Chapter Three, Exercises 


2, problem 1(a)). Because ||-||, and ||-||, coincide on E, we see that |r (v) |. = 


|| and we have proved that T is norm preserving. 
1 


Now let (V,(,)) be an inner product space, and let (V; (,),) and (V2 (,).) 
be two completions of this space. We regard V to be a dense linear subspace 
of the Hilbert space (Vj, ||-||,) and a dense linear subspace of the Hilbert space 
(V2, ||-l|,). As above we define a one-to-one, linear map T : V; — V2 that is 


onto. Let u.v be any two elements of V, and find {un} : {vn} in V such 


that lim u, = u and lim v, = v for lI-Il,. Then 


(r(a).7(8)), =1im(r (a) .7 (F)), 


= lim (Un, Tn), 


=tim (iin, Fn), = (3,7) 


showing that T preserves the inner product. 


Now that we have proved Theorem 3 we shall speak of the completion of a 
normed or inner product space. We shall also identify the space with its ”copy” 
in its completion. Thus we shall say that any normed space is a dense linear 
subspace of its completion. 


Appendix C 


Solutions to the 
Starred Problems 


Note: Throughout this appendix we shall write ”iff” in place of the phrase "if. 
and only if.” 


Chapter Zero 
0.1.2 We have defined (z, y) to be {{z}, {z, y}}. 


(a) (x.y) = (yx) ifr=y 
If x = y, then we have (z, y) = (z,z) and (y,z) = (z,z), and these 
are the same. Now suppose (z,y) = (y,z), then {{z},{z,y}} = 
{{y}.{y,r}}. We want to show that z = y. Suppose this is false. 
Then we have {z} € {{y}, {y,z}} and {x} 4 {y,z}, hence {x} = {y}. 
But this gives us z = y and we are done. 


(b) (z,y) = (u,v) if c = u and y = v. Assume (2, y) = (u,v). Suppose 
first that u = v. Then (u,v) = {{u}, {u,v}} = {{u}, {u,u}} = 
{{u},{u}} = {{u}}. Now {z}e(u,v) and so {zr} = {u}, giving us 
x = u. Furthermore, (x,y) must be a singleton (because, as we have 
just seen (u,v) is a singleton). Hence {x,y} = {<x} and because 
ye{z,y}, y = xz. Thus x = y = u = v in this case. Similarly. if 
r=y,thenz=y=u=v. 

Suppose now that z # y and hence u # v. Then both (z,y) and 
(u,v) contain one singleton: {z} and {u}. Clearly then, {x} = {u}, 
giving us u = x. Furthermore, each ordered pair contains exactly 
one unordered pair: {z,y} and {u,v}, hence {z,y} = {u,v}. Now 
ye {z,y} so ye {u, v}. If y = u, then y = z, which is false. Thus y = v. 
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0.1.3 X¥-Y=XNY 


Let xeX — Y. Then reX and z ¢ Y. Thus reX and z ¢ X NY, hence 
zeX —-XNY. Thus X —-Y CX —XNY. Now let reX —X NY. Then 
reX and x ¢ XN Y. Thus z cannot be in Y. So reX,z ¢ Y giving us 
reX —Y.ThusX —-XNYCX-Y. 


0.1.4 (a) Show (i) Ce(X UY) = Ce(X)NCgE(Y) and (ii) Ce(XNY) = 
Cr (X)UCE (Y). 


i. reCe(X UY) iff z f X UY iffe ¢ X andz ¢Y iff reCe(X) 
and xreCe (Y) iff reCe (X)NCe(Y). 

ii, ceCe(X NY) iffe ( X NY iffa ¢ X or xz ¢Y iff reCge_(X) or 
xeCp (Y) iff reCe (X)UCE(Y). 


(b) X-Y=XN[Ce(XNY)] =XNCE(Y) 
reX —Y iff reX and z ¢ Y iff reX and xreC Eg (Y) iff reX NCeE(Y). 
Thus X —-Y = XNCg(Y). By problem 1.3, X -Y=X—-XNY 
and by what we have just shown X —- XNY = XNCE(XNY). 
Thus X -Y =XNCge(X NY). 


(c) XUY = E iff {reE | z ¢ X} CY iff Ce(X) C Y. By symmetry 
XUY=EiffCge(Y)C X. 


(d) IfX CY andzeCe(Y), thenz f Y soz ¢ X; ie., if reCe (Y), then 
zeCe (X), giving us Ce (Y) C Ce(X). Now suppose Ce (X) 2D 
Cr (Y). By what has just been shown, Cg [Ce (X)] C Ce [Ce (Y)); 
ie., X CY. 


If X UY = Y, then for any zeX, zeX UY and so zeY. Thus X C Y. 
Conversely, if X C Y, then any zeX UY must be in X CY or in Y; ice., 
X UY CY. But clearly Y C X UY andso X UY =Y. 


If XN Y = X, then zeX implies reX NY and hence zeY. Thus X CY. 
Conversely, if X ¢ Y, then for any reX, zeX and xeY hence reX NY. 
Thus X C XNY. But X DXNY andsoX =XNY. 


0.2.1 (a) Because = is an equivalence relation it is reflexive (property (a) of 
the definition). Thus, for every zeX, z = x and so ze [zx] showing 
that this set is nonempty. 


(b) Suppose ze [xz] N [y]. Then x = z and y = z. Because an equivalence 
relation is symmetric, we have r = z and z = y (property (b) of the 
definition). But then by property (c) of the definition we must have 
r = y. It follows then that [z] = [y]. 


Now suppose |x] # [y]. Then « # y and because ze [z] M [y] implies, as we 
have just shown, that z = y, we must have (z] N [y] = ¢. 
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0.2.2 f :X — Y, g isa partial function from Y into Z. 


(a) 


(b) 


Suppose R(f) C D(g). Then for any reX, f (x) eR(f) C D(g) and 
so g[f (z)] has meaning; i.e., it is a unique element of Z. Thus when 
R(f) © D(g), g[f (x)] is a well-defined function from X into Z. 


Let f : X — Y and suppose that there is a function g : Y — X such 
that (fog) (y) = y for all yeY and (gof) (x) =z for all zeX. Then 
f is onto because it maps the element g(y)¢X onto the element 
yeY showing that R(f) = Y. It is also one-to-one. For suppose 


f (x) = f(x’). Then x = g[f (z)] = g[f (2")] = 20 


Now suppose f : X — Y is given to be one-to-one and onto. Then for 
each yeY the set pre; (y) is a singleton. We set g (y) equal to the unique 
element of this set. Then clearly g[f (z)] = 2 for every zeX. We also have 
Ff (9 (y)] = y, by the way g was defined, for every yeY. 


0.2.3 We have a set X and its power set P(X). 


(a) 


(b) 


We define, for Y,Z in P(X), Y = Z to mean there is a one-to-one 
mapping f from Y onto Z. Clearly, Y = Y for every YeP(X) for 
we may define f(y) = y for all yeY and in this way obtain a map 
f :Y — Y that is one-to-one and onto. Suppose that Y = Z. Then 
we have g: Y — Z, g is one-to-one and onto. By problem 2(b) we 
know that the map g~! : Z — Y exists and that it is one-to-one and 
onto. Thus Z = Y showing that our relation = is symmetric. 
Finally, suppose Y = Z and Z=W.Let f:Y ~Zandg:Z—~W 
be one-to-one, onto functions. Consider gof : Y — W. This map is 
onto because given any weW there is a zeZ such that g(z) = w, 
and there is a yeY such that f(y) = z, hence (gof)(y) = w. It 
is also a one-to-one mapping because (gof)(y) = gof (y’) implies 
f(y) = f (y’) because g is one-to-one, and this implies y = y’ because 
f is one-to-one. Thus Y = W showing that = is transitive. 


For Y, Z in P(X) define Y < Z to mean Y C Z. It is clear that for 
any YeP(X), Y C Y andso Y < Y. Thus our relation is reflexive. 
Suppose that Y < Z and Z < W. Then Y C Z and Z C W 
hence Y C W, which means Y < W. Thus our relation is transitive. 
Finally, if Y < Z and Z < Y, then we have both Y C Z and Z CY, 
which means Y = Z. So our relation is antisymmetric. 


0.2.5 We have a family of sets {Ay | AeJ} with A, C X for each Ae. Then 


(a) 


Cx [ver A] c= net OX (Ay) 
We have zeCx [,7, Ay] iff c ¢ UA) iff z ¢ A) for every Xel iff 
zeCx (Ay) for every Ae! iff ze.) Cx (Ay). 
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(b) Cx Ler Ay] = ner CX (Ay) 
We have reC'x [NA)] if z ¢ yep AX iff z ¢ A) for some Acl iff 


zeCx (Aj) for some el iff ze 2, Cx (Ap). 


0.4.1 (a) Given z =a+bi, 7 =a—bi and so z- 2 = a? — abi + abi — b7i? = 
a? + b? = |2|*. 
(b) If z=a+hi £ 0, then |z| # 0. Now z- (747) = l= 1, thus any 
z #0 has a multiplicative inverse. 


Suppose we have z-w = 1 and z- 2’ = 1. Then 2’ = 2’-1l=2'-(z-w)= 
(2’-z)-w=1-w=w. Thus the inverse of z is unique. 


0.4.3 We are given z = r(cos@+isin@) and w = s(cosy+isiny). 
(a) z-w=r(cos#?+isin6@) -s(cosy +isiny) 
= rs((cos@ cosy — sin@sin vy) + i (sin@ cos p + cos sin y)| 


= rs (cos (6 + y) + isin (8 + »)]. 


(b) We want to show that z” = r” (cos @ + isin @) for every natural num- 
ber n. Clearly, this is true when n = 1. Suppose that for some 7, it is 
false. Then {n ¢ N | z” # r” (cosn@ + isinn@)} is not empty. Let no 
be the smallest member of this set. As we have already noted, no > 1. 
Now 7o — 1 is a natural number and, from the way no was defined, 
zno-l = pro! [cog (ng — 1)8 + isin (no — 1) 6]. But then z"° = z- 
z™-! = r(cosO+isind) - r%—! [cos(no — 1)6 + isin (no — 1) 4), 
giving us, by (a), 27° = r™ (cosno@ + isinno@). This is a contra- 
diction. 


0.4.4 We are given two complex numbers, z, w. 


(a) z = re'® and w = se'®. Now |[e'?| = |cosa+isina| = (cos?a+ 
sin? a)! = 1 for any real a. Thus |z- w| = rs = |2|- |w]. 
(b) We want to show that |z + w| < |[z| + |w]. 


Note that |z+w|? = (z+w)-(zZ+w) = (z+w)-(%+w) and hence 
[z+ wl? = |z[? + 20+ 2-w + ||’. 

Now write z = re'®, w = se’, then z-W+zZ-w =rs [e'(9-¥) + efl¥-9)] = 
rs (cos (8 — y) +isin(@ — y) + cos(y — A) + isin (yp — 4)] = 2rscos (6 — ») 
because cos (9 — y) = cos(y — 8) and sin(@ — y) = —sin(y — 9). Thus 
z-w+zZ-w is the real number 2rscos (@ — y), which is clearly less than or 
equal to 2rs = 2|z||w|. Using this we have |z + w|? = jz/?+2-w+z-wt 
jw|? < |z|? + 2|z| lw] + {w|? = (|z| + |w])? and this proves our inequality. 
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0.4.5 Let p(z) be a polynomial with complex coefficients having degree n > 0. 
For any complex number r we may divide p(z) by (z —1r) to get p(z) = 
(z —r)q(z) + p(z). Here p(z) = 0 or degp(z) < deg(z — 1); i-e., p(z) 
must be a constant. Let r; be a root of the equation p(z) = 0. Then 
p(r;) =0, so p(z) = 0 and we have p(z) = (z — r;) q: (z). Clearly, q: (z) 
has degree n — 1. If n—1 # 0, then the equation q, (z) = 0 has a complex 
root rz. As before g, (z) = (z — r2) q2(z) where q2(z) has degree n — 2. 
Thus p(z) = (z —1)) (z — r2) gg (z). Continue in this way. The process 
ends when we have p(z) = (z —171)-:-(z —Tn) Qn (z) because then the 
degree of g, (z) must be zero; i.e., g, (z) must be nonzero constant. 


0.4.6 For any z =a+bi in C, u(z) =a and v(z) = b. Now let f : R — C be 
given. Then f (rz) =a+ bi = u(z) + v(z)i = (uof) (rz) + (vof) (x) i, and 
we may set g(x) = (uof) (x), h(x) = (vof) (x) for all zeR obtaining two 
functions from R to R such that f (rz) = g(z) + ih (z) for all z. 


0.5.2 We are told that f is differentiable at soeK. We want to show that f is 
continuous at this point; i.e., we want to show that Pasig f(s) = f (S80). 
The trick is to write 


f(s) = 


He) LE80)| (5 — 4) + (20) 


8S — So 


Taking the limit as s approaches s9 the quantity in the square brackets 
tends to f’ (so) and hence Beanie | (s) = f (so) as was to be shown. 


0.5.3 We are given f : R — C and we write f(z) = g(r) + th(z) where g,h 
map to R to R, (problem 0.4.6). 


(a) |f (x) - (uo + ivo) |" = |(g (x) — up) + i (A(z) — v0)|” = |g (x) - vol” + 
|h (x) — vo|?. Now if ™ f(r) = uo tivo, then _''™ g(x) = uo and 


Z-Zo xr—Zo g\z 
am A(z) = vo. Conversely, if limg(z) = uo and limh(z) = v9, 
then lim f (rz) = uo + ivo. 
(b) f is continuous on [a, 6] iff both g and h are continuous on this 
interval. This follows immediately from (a) and the definitions. 


(c) 


2 


J () ~ So) Soe ee g(x) — 9 (Zo) _ 
I—-Iq I—- Io 
I—-Io 


Thus if _'™ Lz) {20} = ug + ivo, then g and h are differentiable and 


z-—--Zy 


g' (x0) = uo, h’ (zo) = v0. Conversely, if g' (0) = uo, A’ (x0) = vo, then 
f' (zo) exists and is equal to uo + tuo. 
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0.5.7 (a) 


(b) 


(c) 


(d) 


lim 


Suppose ,,"'..5n = 8 and let {s,,} be any subseqnence of {s,,}. 
Then given € ss 0 there is an N such that |s9 — s,| < € whenever 
n > N. Also, there is a kg such that ny, > N when k > ko. It follows 


that [so — sn, | < ¢ whenever k > ko, showing that me Sn, = So. 


Suppose that {s,} is a Cauchy sequence. Then given € > 0 there 
is an N that is |sn — 8m| < € whenever both m and n exceed N. 
If {sn,} is any subseqnence of {s,,}, then there is a ko such that 
ny > N when k > ko. It follows that |sy, — sp,| < € whenever k and 
l exceed kg. 


Let {s,} be a Cauchy sequence and let {s,,} be a subsequence 
such that eae Sn, = 89. Then given € > 0 there is a ko such that 
lsn, — 80] < § when k > ko. We can also find an N such that 
ISn — Sm| < § when m and n both exceed N. Now if n > N we can 


choose k > kg so that nx, > N. Then 


€ € 
I80 — 8n| S [80 — 8] + l8m, — Sul << 5S +5 HE 


Hence see Sn = 8. 

Let {s,} be a Cauchy sequence. Then we can choose N so that 
lon — Sn] < 1 when m,n > N. In particular, we have |s,| < |s, — sv|+ 
lsw] < 1+ |sn| for alln > N. Let K be an integer that is greater 
than |s,| ,|s2|,...,|s|. Then we have |s,| < 1+ K for all n. 


Chapter One 


1.1.1 We have u = u(z,y) and r = rcos@, y=rsin8@. 


Ou _ Wudr , du dy _ Ou Py ey, 
Or anor | OyOr Oz dy 
ru _ osd Ofudz | Ou dy oar 0?u Ox , Fudy 
Ore °°" | O22 Or | Oydx Or Ozdy Or? Oy Or 
72 2 
= 54 cos? 8 + 2sin 8 cos 0 - : 7 Oe aa 29 


Or? Oy? 
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because 
O7u _ Pu 
OrOy OyOx 
Ou OQudr du Ou Ou 
nee ae = 3 ae (—rsin@) + ay (r cos 4) 
Ou du Ou . . O7u Ox O7u Oy 
302 ~ De (—rcos6) + By (—rsin@) + (—rsin@) ee OyOx 30 
Ou Ox O*udy 
epee!) Sad B8 4 
_ Ou _ ,du 2. 2,07 2 Oru 
= r cos rane +r sin O53 —2r Bin CO aor 
O7u 
+1 cos’ O55 
now 


Oru  1du 1 dtu _ Hu | Hu 
Or2 r Or r2002—— Ox? Oy? 


1.2.3 f : R — R is continuous and a > 0 is any element of P(f). We distinguish 
several cases: 


(a) a>O0 
—T |... 
Case 1. 0< 2x. Then 0 x x+a 
ZI+a Z+Q z a zr+a z 
ad a ae 
z 0 0 0 Qa 0 
Now 


[senate [peur ayau= f° p(w du 


a 


where we set t = u +a and used the fact that f (u+a) = f (wu). It 


now follows that 
z+a a 
L =f 
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Case 2. x < (0). Here we have two subcases: 


i. -x 0 a-Xx a 


~-r+a 0 a-z 0 a a 

E, “EXE AE aes 

-2© -—z 0 -zx 0 a-zr 
But 


[sede f suraau= fi saya 
hence / ” = i ° 


Migs ES ig! eh ys 4 
Again 
Lia, *E 


(b) a<0 
Then 


f r f 
zr-a y 0 


where we have set y = r — a. 


1.2.4 We have g(x) = acos Az + bsin Az, where a,b are complex numbers not 
both zero and J is a positive constant. 


It is clear that P(g) D> {n(2%)|ne Z}. and if a = 0 (or if b = 0), 
these two sets are equal. We want to show that these sets are equal when 
neither a nor 6 is zero. 


Let a € P(g), so that g(z + a@) = g(x) for all x. Then acosA (z+ a) + 
bsin A(z + a) = acosAz + bsin Az for all zr. A little algebra gives us: 


(a. cos Aa + bsin Aa — aj cos Az + [bcos Aa — asin Aa — 6] sin Ar = 0 


for all zx. 
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We need a simple result: 
Lemma. If Acos Ar + Bsin Az = 0 for all z, then A = B = 0. 


Proof. Set z = 0 to see that A = 0. Then Bsin Az = 0 for all z. Clearly, 
B = 0 (take, for instance, z = 55 ). This proves the lemma. 


We can now conclude that a (cos Aa — 1)+bsin Aa = 0 and that b (cos Aa — 1)— 
asin \a = 0. Multiply the first of these by a, the second by 6, and add 
the results to obtain (a? + b*) (cos Aa — 1) = 0. 


If a? + b? ¥ 0, then cos\a = 1; hence Aa = 2k7m and a = 2%, where 
ke Z. 


If a? + b? = 0, then a = +ib and clearly b 4 0 (otherwise we'd have both 
a and 6 zero). Thus when a = ib our equation becomes ib (cos Aa — 1) + 
bsin Aa = 0 or i(cosAa — 1) + sinAa = 0. Equating real and imaginary 
parts gives cos Aa = 1. The case a = —6i is similar. 


1.2.6 Given f : R — C there are functions g,h that map R to R such that 
f (x) = g(x) +ih(z) for all x (see Chapter Zero, Exercises 4, problem 6). 
We must show that (a) P(f) = P(g)N P(h). 


It is clear that P(g)NP(h) € P(f). Suppose a € P(f). Then f (z + a) = 
f(z) for all z, giving g(z +a) + th(r+a) = g(z) + th(z) for all z. 
Equating real and imaginary parts we see that a € P(g) anda eé P(h). 


(b) If f is periodic, then P(f) contains a nonzero member. Thus P (g) 
and P(h) both contain this nonzero number showing that g and 
h are periodic. To see that the converse is false, consider f (z) = 
sin 2rz + isin (27/2) z. Then P(g) = {n|ne€Z} and P(h) = 
{+ [ne z}. Clearly, P(g) N P(h) = {0}; hence P(f) = {0} and 
f is not periodic. 


(c) If f is continuous, periodic, and nonconstant, then f has a smallest 
positive period. 


We have seen that f is continuous iff g and h are (see Chapter Zero, 
Exercises 5, problem 3(b)). If f does not have a smallest positive pe- 
riod, then P(f) contains arbitrarily small positive numbers. But because 
P(f) = P(g)NP(h) € P(g) we see that P(g), and similarly P(h), 
contains arbitrarily small positive numbers. It follows from Lemma 2 of 
Chapter One, Section 2 that both g and h are constant, and this is a 
contradiction. 


1.2.7 Let f : R — C be continuous and periodic. Then f is uniformly continuous 
on R. 


Let ¢€ > 0 be given. We must show that we can find a 6 > 0 such 
that |r — y| < 6 implies |f (z) — f (y)| < ¢. Let a € P(f), a > 0. 
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1.3.1 
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Then by Theorem 3 of Chapter Zero, Section 5 we know that f is uni- 
formly continuous on (0,a]. Thus, for the given €, we can find 6 so that 
lf (z) — f (y)| < € whenever z, y are in (0, a] and |x — y| < 6. Clearly, we 
may suppose that 6 < a. 

Now let z,y € R be such that |x —y| < 6. We may suppose z < y. 
Consider{[na, (n + 1) a] | n € Z}. There are two cases: 


(i) na<zr<y<(n+1)a for some n. 
Then |f (y) — f (z)| = If (y — na) — f (x -— na)| < € because z — na 
and y — na are in [0,a] and |(z — na) — (y — na)| < 6. 

(ii) r< na <y < (n+ 1)a for some n. 
Then |f(y)-f(z)| < If (y)— f(na)| + |f (na) — f(z)| < 2 
because na and y are in [na,(n + 1)a] and |y — nal < 6, and na 
and zx are in [(n — 1) a,na] and {na — 2| < 6. 


Here 4 is a fixed positive number. 


(a) If ae** + be—** = 0 for all x, then a = b = 0. 

Setting z = 0 gives the equation a+b = 0. Differentiating and setting 
x = 0 in the result gives A(a — 6) = 0. Solving these equations we 
see that a = b= 0. 

(b) Set f (xz) = cye** + coe~** where c, and cp are constants not both 
zero. Suppose f (z + a) = f (x) for all z. Then 0 = c, (e% — 1) e** + 
co (e~° —1)e"** for all z. It follows from part (a) that 
c, (e°* — 1) = 0 = eg (e-9 — 1). If cy # 0, then e® = 1, giving 
us aA = 0. Because A > 0, a = 0. If co # 0, then e~®* = 1, and 
again we get a = 0. Thus P(f) = {0} and the function f is not 
periodic. 


1.3.2 Equation (7) of ee One, Section 3 is r?R” (r)+rR' (r)—n?R(r) = 


1.3.3 


Set r = e'. Then 42 = 48 . & or, demoting the derivative with respect 
to t by R, R=Re ot 


Now R = et [ZR’- 4) + RB’ get =e*R" +e'R’. Thus R- R= eR". 
We now have r?R” =r 2 le —2t (R- R)| = R—RandrR = et (e*R) = 


R. Using these equations (7) becomes R —n?R = 0. The general solution 
to this last equation is R = c,e™ +coe~™* and because r = e' we see that 
R=cr" + cer". 


We want to solve r?R” (r) +R’ (r) = 0. Again we use r = e'. We saw in 
the last problem (1.3.2) that e?"R” = R— R and e'R’ = R. Using these, 
our equation becomes R = 0. Hence R = c and R = ct + d where c and d 
are constants. Because r = e‘, R(r) =clnr +d. 
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1.4.1 We have a sequence {u,(r,0}°_, of functions that are continuous on 
GUT and harmonic in G. If {u, (1,)} is uniformly convergent over T, 
then {u,, (r,9)} is uniformly convergent over G U T. 
Given € > 0 we must show that we can find an integer N such that 
lun (7,0) — um (r,9)| < € for all (r,6) € GUT, whenever m,n > N 
(Lemma 1 of Chapter One, Section 4). Now from our hypothesis we can 
find, given € > 0, N so that |u,, (1,0) —u,,(1,4| < e€ for all 0 € T 
whenever m,n > N. Thus —é€ < uy, (1,0) — um (1,9) < € for all 6 € T 
because our functions are real-valued. Because they are also continuous 
on T, we have 


min max 
-E< ge Tita (1,9) — Um(1,9)} < hE pp itn ( (1,0) — um (1,0) } <e 


whenever m,n > N. But by Theorem 1 of Chapter One, Section 1 we 
have 


eo on (iat (1,0) - {uy (7.6) — um (r,6)} 


max 
< 
~ Lad E GUT 


on {tin (1,8) — tm (1,8)} <e 


Um (1,6)} < (, hee 
{un (r, 9) — um (r,8)} 


Seer 


whenever m,n > N. 


1.4.2 (a) Forneé RN, n fixed, uy, (r, 0) = r” (a, cosné + 6, sinn@) is harmonic 
in G, i.e., inside the unit circle. 


a = nr"! (a, cos0 + bp sin n8) , 

OPun n—2 ; 

Co i (n —1)r"~“ (a, cosn@ + by, sin nd) 
Oun : du, 
rT is (—na,, sinn@ + nb, cos n8) , 72 


= r" (—n?a, cosné — nb, sin n6) 

+ ae + = =n(n—1)r"~' (a, cosn6 

+b, sinn@) +nr"~? (a,, cos né + 6,, sinn@) 

— n*r"~? (a, cos nO — b, sin @) 
= [n(n-1)+n-—n?]r"~? (a, cosnd — by sin 6] 
=0 forall (r,A)EG 
(b) For any fixed p, 0 < p < 1, the series >>", p", ).-, np”. 0, -, n7p” 

are convergent. 
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Because these are series with positive terms, we can apply the ratio 
test. Clearly, 


prt (n+ 1)p"*} 
p mn np” n2 p” 


Because we are given that p < 1, all three series converge. 


(c) Suppose that for some M, |a,| < M and |{b,| < M for all n. Then 
u(r.6) = 0°, r” (an cosn@ + by sinn8) is harmonic in G, i-e., for 
0<@< 2x andr <1. 


|r” (a, cosn@ + b, sinn@)| < r"2M for every n. Because }¢r” is 
convergent, the M-test (Corollary 1 to Lemma 1, Chapter One, Sec- 
tion 4) shows that the series given converges uniformly in G. Simi- 
larly, each of the series 


Ser r™ (a, cosn6 + by, sinn§) = So art ee, 


2 
> sa " (an cos né + b, sinné) = yon (n — 1) r"~? (a, cosn 
+b, sin n0) 


0 n by n $e 
De 597” (an cosnd + by sind) = } nr” (—a, sin nd + by cos n6) 


2 
> oar (a, cos nO + b, sinn@) = a n*r” (—an cos nO — by sinn6) 


is uniformly convergent. Writing u, (r, 6) for r® (a, cos nO + 6, sin n8) 
we nay use Lemma 4 of Chapter One, Section 4 to obtain 


Ou 1du 1 d0u O7u, 1l10u, 1 Ou, 
or? ror +a L( Seti eta) 


which is zero by (a). 


1.5.2 If f € C,(T), then there is an M such that |a,(f)| < Mf and |b, (f)|] < M 
for every n. 


Because f is continuous on T there is a number yp such that |f (@)| < 

we for all 6 € T. Thus because a,(f) = + f"_ f (0) cosnOd6 we have 

a < vo < #(27) = ou f an n = 0,1,2,... and because 
=1"_ f (6) sinn@dé we have |b, (f)| < 24, n = 1,2... 
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1.6.2 Suppose that the sequence {a,}”_, is convergent to b. We claim that the 


1.6.3 


| ge} 


a; +a2+...+@,, 
73 


numbers o, = , n=1,2,... also converge to b. 


a, +aqg+...+a, —nb la, —b[ +... + la, — | 
lo — b| = | —_ “gc < mm 
n n 
Now given € > 0 we may choose N so that |a, — 6| < § when n > N. 


Then 


ja, —b] +... + Jan_; — 9 ‘ lan —b| +...+ lan — | 

n n 

2 Jar — b+... + lan-1—b] | (n= N+1)E 
~ n n 2 


lo-bl< 


We note that n= Nt < 1 and that by choosing n > N sufficiently large 
we can make the first term less than 5; because the numerator of this 
term is fixed. 


Km (zx) = 


° M ° 
O. 
Fi 
oe 
M a 
O. 
a 
a 
+ 
M 
©, 
a 


a a 


[meio + (m—1)e*= +(m—1)e"* +... + ml 


= - | e(O)z 4 (1 — =) et + ¢ — =) ae ae 
m m m 


l os m — l ei(m—l1)z 
m 


a m-—1l 
_ |k| tke __ = | —tkz 
= _ (1 = e- = » 1 - € 


k=-—(m-1) 


II 
reer 
—y 


+ 
3e-me_e——™ 


as was to be shown. 


For any f € C,(T) there is a sequence of trigonometric polynomials that 
converges to f uniformly over T. 


Given any n € N we can find (Corollary 2 to Theorem 1, Chapter One, 
Section 7) a trigonometric polynomial 7, such that |f (9) — Tn (8)| < + for 
all 6 € T. The sequence {7,,} chosen in this way converges to f uniformly 
over T; i.e. given € > 0 we choose N € N such that y < € and note that 
|f (8) — t (8)| < + <e for all 0 € T whenever n > N. 
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1.7.2 (a) Let 7 (6) = hei & je", Then f”_ 7 (6) e~'"9d0 = 0 unless n = n; 
for some j7. When n = n; the integral is 27. Hence the Euler formulas 
(Chapter One, Section 5) show that the complex Fourier series of 
T (0) is 7 (6). 


(b) Given 7 (py) = 0 5_, cje'™” = S05, cj (cosnjy + isinn;y) and, as 
in part (a), the tae formulas Tyo that this last sum is the real 
Fourier series of 7 (y). 


1.7.3 
1 1 
l-rei* 1-—r(cosw + isinw) 
1 (1 — rcosw) + irsinw 


~ 1 —rcosw — irsinw “(= rcosw) + irsinw 
_ l-rcosw+irsinw 
7 (1 ~ rcosw)? + (rsinw)* 
_ 1 —rcosw +irsinw 
~ 1 = 2rcosw +r? (cos? w + sin? w) 
1—rcosw r sin w 
~ | 2reosw +r? aah (; — — | 


oer 2(1— rcosw) 


id ~142Re | T= 2reosw +r? 
~1+ 2rcosw — r? +2 — 2rcosw 
1 — 2rcosw + r? 
1-—r? 


1 -- 2rcosw + r? 


1.7.6 If f €¢ C,(T) and c, (f) = 0 for all n, then the partial sums, and hence the 
Ceraro means, of the Fourier series of f are all identically zero. Because 
these means converge to f nmiformly over T, f must be identically zero. 


Chapter Two 
2.3.1 


; : z : 1 
lim cz _ iim 5 lim 5 


r—O02sinkz 2—O0sin(Z) x — 0 }cos (2) 
_ 0+h)-—f(0+0 
7,(0)= ,lim £O+h)= F040) 


where f (0 + 0) = 


=] 


lim f (0 + h) 
h — 0+ 
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and in this case f (0+ 0) = 1. Thus 


h 
2 = 

— sin (4) lim 4-sin (4) 

fi. (0) = fo ger (ah 
—0 h h—0 hsin (3) 
_ lim __3-3¢08(3) 
h—0* Feos(H) + sin @) 
lim + sin (4) 


= ms = 
~ h— 0+ = sin (4 a)y+3 1 cos (8) + $.cos (4) 


Similarly, f’ (0) = 


2.3.4 (b) We are given f uniformly continuous on (a, 6). 
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i. Let {z,} be a Cauchy sequence in (a, b). We claim that {f (rn )} 
is a Cauchy sequence. Let a > 0 be given. Because f is uniformly 
continuous, we can choose 6 > 0 so that |x — y| < 6 implies 
lf (x) — f (y)| < €. Because {z,} is a Cauchy sequence, we can 
find N such that |z, —-2m| < 6 when m,n > N. But then 
lf (tn) — f (2m)| < a@ when m,n > N, and we are done. 

ii. Now let {z,} ¢ (a,b) and suppose ae = a. Because 
{f (rn)} isa Cauchy sequence it converges to, say a. We want 
to show that Pea nod (a +h) =a; ie., given € > 0 we must show 
that there is a 6 > 0 such that 
lf (a +h) —a| < ¢ when 0 <h < 6. First use the uniform conti- 
nuity of f to find 6 so that |f (x) — f (y)| < § when |r — y| < 6. 
Now |f (a+h) —al < |f (a+h) — f (tn)I+/F (Zn) — a]. Choose 
N such that a < ty < a+h and |f (zy) -—al| < §. Note 
that when 0 < h < 6 we have |(a+h) —zy| < 6 and hence 
If (a +h) — f (zn)| < §. Thus 0 < h < 6 gives |f (a+ h) — al < 


€ and we are done. The proof that nn ,f (b—h) exists is sim- 
ilar. 


Chapter Three 


ab 


3.1.1 For any Z, y in E, r=xc-yty and so [| < = - || + [ly] or 


[#l|— [|| < [=v], Because uy =v - = +, [ull - [|| < [¥ - 2) 


But || - all = |(-0 (v2) | = Wnty — 2] Thus | - | i 
greater than, or equal to, both ||| - eal and 3 - || , and so we 


have 


T= lla < |? - ¥ 


202 Appendix C am Solutions to the Starred Problems 


3.1.2 The vector (1,1) € C? has the property that |1|* + |i]? = 2, whereas 
1? +77 =0. 


3.1.3 (a) Let y € 6, (Zo). Then ||zo-y < e. Let us take 6 to be less 


(b 


(d 


~~” 


= 


wee” 


than the smaller of the numbers ||z9 — ull. E— zo —- al. Note 
that with this choice 6 + ||zo — y < €. Suppose||7 - | < 6. 
Then lz — Zo| < Ig - y| + ly - o|| <d6+ ly - Zo|| < eé. Thus 
05() ¢ & (Zo). 


Suppose y ¢ 2 (Zo). Then y= Zo|| > e. Let us take 6 < 
Ie = Zol|-e and consider O; (y). If z € Os (v), then |z = Zo|| ae 
|(z-9) ~ (zo-y)| and this, by 3.1.1, is 


> ||(#0- v) - (2 - oD I — el - (|v - Fol -«) [>< 


Thus Os (v) NB (Zo) =0 = 


Let {Ogla € I 1} be a collection of open sets aud let O = (2, Oa. If 
xz €O, then zt € Og 1 for some a € J. Because Og is open eee 
€ > 0 such that O -(z) C Og. But then O, (z) C O, showing that O 
is an open set. Now let {O;}" j=1 bea finite family of open sets and let 
O =Nj=,0;.- If z € O, then z € O; for every j. Thus because O; is 
an open set, we have €; > 0 such that O, ,(z) € Oj, and this is true 
for j = 1,:2,.5.,7: Let € = min {€;}5_ ,; Then O, (z ) oO, (z) CO, 


for j = 1,2,,n. Hence O, (z) Cc O. If O = Q, it is an open set by 
definition, for it is a neighborhood of each of its points. 


A set is closed iff its complement is open. Let {C, |aeEI} bea 
BO eee let C = eo Ca: Then E-C = E- aepca = = 
ac; (E — Ca) (Chapter Zero, Exercises 2, problem 5). Because each 
Ca is closed, each E — C, is open and so by (c), E — C is open. 
Thus C is closed. 


Now let {C;}5_, be a finite family of closed sets and let C = U?_,C)j. 
Then E-C = E- Us_ Cj = Nj, (E — C;) by (Chapter Zero, Exercises 
1, problem 4). Because each of the sets E — C; is open, so also is E —C 
by (c). We conclude that C' is a closed set. 
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3.2.1 (E,||-||) and {zn} C E given. and {zn} converges to Zo for all ||-|). 


(a) 


G 


In This last quantity 


By 3.1.1 we have o|| - 
tends to ZeTO aS N —+ 00, and the first quantity is nounegative. Thus 


Given € > 0 we choose N so that |/z9 — znl| < § 2 
Then ||z, — Zm| < (=. — Zo) + (Zo - im)|| < | Lo = In 


| zo Sw on | <eéewhenm,n>N. 


< |?o-2.| 


lim 
n—-00 


In 


3.2.2 S is a nonempty set, S’ the set of adherent points of S, S=SuUS". 


(a) 


Io € S’ iff there is a sequence of distinct points of S that converges 
to Zo. If {in} CS, Sn # Sm when n 4 m and ve ace = 0, 


then given € > 0 there is an N such that Sn E O, (0) whenever 


n > N. Clearly, {Sn |n> n} is an infinite set and because € > 0 
is arbitrary we see that Zo € S’. 

Now suppose that Zo € S’. We can find s,; € O; (Zo) NS. Next 
we choose s2 € O}; (Zo) NS and sy # 31. Then choose 33 € 
O; (Zo) AS such that s3 ¢ { 31. 3a}. Continue in this way. The 
sequence Sn so obtained consists of distinct points of S, and 


clearly this sequence converges to Zo. 


We shall solve this problem in stages. S is closed iff S’ C S. If S is 
a closed set, then, by part (a) and Lemma 1, S‘ C S. Now suppose 


S'S. Let {3 n} CS and let ,,\ ae ™ Sn = y € E. To prove that 


S is closed we need only show that y € S. There are two cases: If 
infinitely many of the s,, are distinct, then every neighborhood of 
y contains infinitely many points of S. Hence y € S’, but S'’C S 
SO y € S in this case. In the second case only a finite number of the 
Sn are distinct. But then there i is an N such that Sn = §$ 3 n+) for all 
n > N. Because lim s —— y: it follows that y Ee S. 


(S’)’ C S’ and so S’ is a closed set. 
Let Vo € (sy. By (a) there is a sequence {yn} ¢ C S’ such that 


lim — 
n—oc Un 


that O, (vo ) contains infinitely many points of S; this shows that 


oo 


= Y, and the y,, are distinct. Given ¢ > 0 we must show 
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(d) 


(a) 


Vo € S’. Choose N such that vo _ ¥n| < §. Then the set of all 


3 € S such that ||5 - || < § is infinite because yy € 5’. But 


*— voll <|| — val] + [vw ~ vol] <e 


for any such s, | 


S is a closed set. If C is closed. S C C, then § C C. Let yg € 
(SUS) and let {yn} C SUS" be a sequence of distinct points 


that converges to Vo: Infinitely many of the Un are in S, or infinitely 
many of the y,, are in S’. In the first case y, € S’, and the second 
case yo € (5’)’. 


Now let. C be a closed set that contains S. We need only show that 
S’ C C; but this is immediate from (a) and Lemma 1. 


Let H be a linear subspace of E. Then H is a linear subspace. Given 


y € A and A.€ k we must show y € H. Clearly, we may assume 
\}#0.Now y € H=HUH' If y € H, we are done. If y € H’, 


we can find {iin} < H such that Rn = y. Given e > 0 


m—- OG 
— 


there is an N such that ||h, — y 


and because every Ah, € H, Xy € H'C A. 


< IT for alln > N. But then 


= a|| < € forall n > N. Thus lini Nx = \y 


i 


Now let Z, y e€ H. We must show that rt+y isin H. If z and y are 
in H there is nothing to do. Ife H, y € H’, we choose {Un} CH 


such that lim Yn = y. Then lim (z + Vn) = x + y and because 
(Z+y,) € H for all n, (z+y) € H’. Finally, if z,y € H’ 
choose {zn} CH {yn} C H such that lim Zn = x and lim Un = 
y- Clearly, (z, + Vn) € H for all n, and lim (zn + Un) =x+y. 


For any u € V we have (u.0) = (u.2-0) = (2-0.u) a 
2(0, u) = 2(u, 0), If cc 0) # 0, then we could divide and 


obtain 2 = 1. Note: £0 a allo. so0+0=0, so 2- 0=0. 


Chapter Three 205 


(b) 
ju + | +[fe-of =(u+¥, u+v)+(u-v, u-v) 
=(u,u)+(u,v)+ (oyu +(¥,0) 
(u,u) + (-v,u) + (u, a 
+(-v,-9) 

= a+ [e+ fel +P 
=2(|jal'+[>]) 

(c) 


v,UvU 


a+ ol =(u+v,u+0) =(u,u)+ (u,v) + (eu) + (0,2 


2 34/2 
= [el + [P| 
because (u,v) = (v,u) = 0. 


— 


(d) c+] - - fe - of =2 (0,0) +2(8, 22) as we see from (b). 


—_> —_ 2 —_> _ _ —_ 
Now | +iv|| = ( +iv, utiv) = (u, u)- i(u,v) +i[(v,u) 


-i(8,3)] = fff -2(,3) +4 (8,3) + fal 
and oo i[a +s = iff’ + (3,8) (8.3) + aff 


Also 


ecco =(u-iv,u-iv) =(u,u +i(u, v)-i[(v, u)+i(e 


= [ll +#(@. 8) (8.2) + [ef 90 fe — aa 
‘ =— 2 —_ —o_ . 
= ile - (#8) + (ee) + aa 


— —_h 2 -_ eo 
Thus é||u + iv - iv - |" = 2(u,0 -2(0,u 


*)| 
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and so 
— é —_> — 2 —> _— 
|e +f) -|ja- af dt -ilfu - i 4 (u,v) 
4 
- (2.3) 


3.3.3 {in} and {vn} are sequences in V that converge to Uo and vo. 


— 


(a) For any w € V, lim (tin,w) = (to, w). We write 


(imi) ~ (aaa) = [in So)| [in —Zl f b 
C.S.B. inequality (Theorem 2). 


Un— wo|| = (0, we are done. 


—_ 


(b) lim (tin, Un) = (Vo, vo) . Here we write (tins Un) — (uo, ¥o)| = 
(aan) ~ (im) + (m0) ~ (a | 
(an Ba ~ Ha) + (Sn -i,30)| 5 [aba He) 
+| (nial fil fn — Fol +n ~ aa 
n- wo|| — 0 as n — co so the last term tends to zero. Because 
{un} is convergent the set { Un [n= Wee is bounded; i.e., there 
is an M such that |/unll < M all n; 
ia ta + fe 


Un — Vo — 0 we see that the first term in our inequality tends to zero. 


—> 
Un 


tn - Uo + wo < “ial = 0. Be Because 


3.4.1 (a) Sis an orthonormal subset of V. We claim S is maximal iff (a, v) = 


0 all v € S implies u = 0. 
First suppose that S is maximal and let ue V, (u,v) = 0 for 


all v € S. a # 0, then S$su{ s }, and this last set is 


orthonormal. This contradicts the maximality of S. 
Now suppose that the orthonormal set S has the property stated. If 
S C S’, where S’ is orthonormal, we must show that S=S’. Suppose 


u € S'—S. Then «|| = 1 because u € S’, but we also must 


have (u,v) = 0 for all v € S. This implies u“u= 0, which is a 
contradiction. 
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(b) Any z = {z;} 5", such that Cz, €n) =0 for all n must be the zero 


vector because (z, en) = z, for each n. Thus, by (a) the set {en} 
is maximal. 


(c) Let f € C(T) and suppose that ( f, 5==) = she J, f (t)e7intdt = 
0 for every n. Then all the Fourier coefficients of f are zero. It 
follows that the partials sums of the Fourier series of f, and hence the 
Ceséro means of this series, are zero. Because these means converge 
uniformly to f (Theorem 1 of Section 7, Chapter 1) we see that f 
must be identically zero. 


3.5.1 The linear span of the set {e'"*} in L' coincides with the set of trigono- 
metric polynomials (see Chapter One, Section 7 just above Corollary 2). 
We want to show that this set is dense in L' [—7, 7] for ||-||,; ie, we want 
to find, for any given f € L' [—7, 7], a sequence{o,,} that converges to f 
for the L}-norm. Now C (T) is dense in L' [—7, 7] and so, for the given f, 
we can find {g,} C C(T) such that {g,} converges to f for the L'-norm. 
For each fixed n we can find o, such that _™*. |gn (t) — on (t)l < 345 
(Chapter One, Section 7, Corollary 2). Clearly, |\gn — on||, < + for each 
n. But then ||f — onll, < Ilf — 9nlly + Ilgn — onll, < If — 9nil, + = and 
we see that {o,,} converges to f for the L}-norm. 


3.5.2 For notational convenience let us set Up, = —, for each n. By Theorem 
—. oo 
5, {vn} is an orthonormal basis for L? (T). Consider the sequence 
— oo 
2. oe 
{Die Brn Un} P C L?(T). Because >”, | Gn |? is convergent, this 


sequence is Cauchy (see the proof of Corollary 2 to Theorem 1 in Section 
4). Thus }>°° Bn Un converges to an element f € L? (T). But then, by 


the discussion at the very beginning of Section 4, we must have ( f, Un) = 


G i) = $B, for every n. 


Chapter Four 


4.1.2 (a) (a*b)(n) = }>,, a(n — m)b(m) = D>, a(k) b(n —k) = (b* a) (n), 
where we have set k = n — m (remember 7 is fixed). 
(b) A(a*b) = A{(a*b) (n)} = A{ Lp, a(n — m) b(m)} = 
{>> Aa(n — m) b(m)} = {55 a(n — m}-Ab(m), and the first of these 
is Aa * b, whereas the second is a * Ab. 


(c) [a* (b*c)](n) = 97, a(n — m) (b*c)(m) = 
dom a(n — m) (0, b(m — k) c(k)) = 
> C(K) (3, @(n — m) b(m — k)) if we let 1 = m — k this becomes 


208 Appendix C am Solutions to the Starred Problems 


don c(k) (XU, a(n — (L+ k)] (1) = 
din (kK) (90, 2 [(n — &) — Yb(0)) = DU, (a * 8) (n — k) c(k) = 
[(a * ) * c](n). 
(d) [a+ (b+c)|(n) = 7, a(n — m) (b+ ¢) (m) = 
yom a(n —m) {b(m) + c(m)} = do, a(n — m)b(m) + 
=, a(n ~_m)c(m) = (a*b)(n) + (a*c)(n) 
(e) Suppose a*c = c*a =e anda*b=b*a=e. Then b= bec = 
bx (ax*c) = (b*a)*c=(e)*c=C. 


4.1.3 |laeby—a* boll, = lla* (bn — boll, << llall, Ilbn — boll, Because 
ue llbn — boll, = 0, we see that ""a*b, = a * bo for every fixed 
aeél'(Z). 


If we define T,, (b) = (a * 6) for all b € l', where ae I! is fixed, then T, 
is linear by 4.1.2 parts (b) and (d). This map is also continuous because 
Ta (BDI, = lla * bl}, < Hall, [Olly for all Be 2. 


4.1.5 (a) g(1) =1, g(n) =O alin #1. Then g*g(n) =}),,9(n—m)g(m), 
and this is zero unless m = 1 and n —m = 1. Thus (g *g)(n) = 1 
when n = 2 and is zero when n # 2. Suppose g*(n) = 1 when 
n = k and is zero when n # k. Then g*t!(n) = g* g*(n) = 
ym g(n— m)g* (m) is zero unless m = k and n—m = 1. Thus 
gtt! (kK +1) =1, and g**! (n) =O ifn AK +1. 

(b) g-'(n) = 1 when n = —-1, g~!(n) = O when n # —1. Then g * 
g-'(n) = 4, 9(n—m)g~'(m), and this is zero unless m = —1 
and n—m = 1. Sog*g7! (0) = 1, and g*g7! (n) = 0 for n # 0; i.e., 


g*g '! =e. Now g7! *g(n) = 5g! (n—m)g(m) is zero unless 
m= 1andn—-—m=-1.So g™!*g =e and we have shown that 
g~' is the inverse of g. 

(c) (g7})° (n) = (g~' *g7') (n) = Do g7' (n—- m) g7' (m). This is zero 
unless m = —1 and n— m = —1. So (g-')’ (n) = 1 when n = ~2 
and is zero when n # —2. As in (a) we see that. (g-)* (n) = 1 when 
n = —k and is zero when n # —k. 

(d) a € l' soa = {a(n)}. Also D™a(n)g” = ... ta(-l)g7' + 


a(0)g° +a(1)g' +... = {a(n)}™, and this is just a. 
4.2.1 We have a € 1! (z) fixed, I (a) = {a*b|bel'}. 


(a) Because a+ e =a, a € IJ (a). Clearly,a*b+a*c=a*(b+c), and 
A (a * b) = ax(Aa) by 4.1.2 (b) and (d). Thus J (a) is a linear subspace 
of !'. To show that J (a) is an ideal, we must show that when c € I (a) 
and d € l', d*c € I(a). Now c € I (a) and so c = a * b for some 
bel'. Also d*c=d*(a*b) = (d*a)*b = (a*d)*b=a*(d*b) 
by 4.1.2 (c) and (a). 
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(b) We want to show / (a) is a proper iff a is not invertible. First suppose 
that a is invertible. Then there is an a~! € /' such that a* a7! = 
a~! «a =e. Hence e € I(a). But then b = b«e € I (a) for every 
b € l' showing that J (a) is not proper. Thus when / (a) is proper. 
a cannot be invertible. Now suppose that a is noninvertible. Then 
e ¢ I (a) for, otherwise, e = a*xb = b*a for some b € I’, which would 
mean a is invertible. Thus J (a) must be proper. 


(c) As discussed just before Lemma 1 we need only show that the closure 
of an ideal is an ideal. Let J be an ideal in l' and let J be the closure 
of I. Then J, and hence J (by 3.2.2 (f)), is a linear subspace of l'. 
Let a € I and let b € l'. We must show that b «a is in J. If a € J, 
this is immediate. If a € \J, then there is a sequence {a,} C J such 
that img a, = a for the /' norm. But then b*a,, € J for all n, and 
nm * inst = bx a for the l'-norm (4.1.3). Thus 6* a € I and we are 
done. 


4.3.4 Suppose that for some e € L)} (R) we have ex f = f for every f € L' (R). 


4.4.1 


Then e *e = e. Hence é(z) - é(z) = €(z) or é(z) (E(x) — 1] = 0 for all 
z € R. Now é(z) is a continuous function (Lemma 1), so either é(z) = 0 
for all x or é(xz) = 1 for all x. The first of these would mean that e is 
the zero element of L' (IR) and in that case e« f = 0 # f. The second 
possibility can be ruled out because ese (x) = 0 by Lemma 2. 


We refer to the statement of Lemma 2. Here M is a multiplicative group. 
Because M # @ there is an x € M. But then x-z7~! = 1 € M and we 
have proved (i). To prove (ii) we note that for any x € M, 1-27! isin M. 
When z, y are in M, xz and y™! are in M by (ii). Hence z- (y)7" =r-y 
is in M. It is now clear that 2” € M for every n € Z. 


4.4.2 f : M — A where M is a multiplicative group and A is an additive group. 


Furthermore f (z-y) = f (x) + f (y) all z,y in M. 


(a) For any rx € M, f(z) = f(z-1) = f(x) + f (1) showing that 
f (1) = 0 and hence 0 is in the range of f. Also f (1) = f (x- 27!) = 
f (x) + f (x71), so f (a7!) = —f (2). 
Now let a,b be in the range of f. We must show that a — 6 is in the 
range of f. There are z,y in M such that f(z) = a and f (y) = 6. 
Hence f (r-y~') = f (x) + f (y"') = a+(—b) and we are done. 

(b) {x € M | f (x) = 0} ¥ @ because it contains 1 (see (a)). If z, y are in 
this set, then f (z-y~') = f(z) +f (y"') = f (x) + [-f (y)]. But 
both f (x) and f (y) are zero, so x- y~! is in our set. 

(c) Uy = {+1, +t}. Note that i=} =1= == = —i and (-i)}= =f = 
i. 
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(d) f:Z2—- U4, f(n) =i" all n. Then f (m+n) =i"? =i™.i7 = 
f (m)-f (n). For positive n, i” = i#9+" = (i4)"-i" = i” because i4 = 
1. Here 0 < r < 480i" is 79, i!, i? or #3. All of these, 1° = 1,i,i? = -1, 
and i° = —i, are in U4. So the range is Uy, because i-" = ay and 
this is in U, for each positive n. When n = 0, f (rn) = 1 € U4. 
{ne Z|f(n)=1} = {neZ|@=1} = {neZ|it™=1} = 
{n|n=4q+r, and r = 0} = all integer multiples of four. 

(e) g: Z + U4, g(n) = (-1)”. Here the range of g is {+1,-1} and 
{n € Z| g(n) = 1} = all even integers. 


4.4.3 Here n is a fixed positive integer and U, = {z € C |z” = 1}. 


(a) If z,y € Un, then 2” = 1 and y” = 1. Also (y~')” = “a = 1. Thus 
(x - yo)" =2". yt)” = 1 and hence r-y™~! is in U,. If Z" = 1, 
then 1 = |Z"| = |Z|" by 0.4.4 (a) . Thus |Z] = 1 because the only 
positive real number whose nth power is 1, is 1. 

(b) If p(x) = 0 has a root r of multiplicity & > 1, then p(zr) = 
(x —r)’ q(x). But then p' (zx) = (4 - r)* q(x) + k(x - r)*—! ¢(z) 
by the product rule. Because k > 1, k — 1 > O and we see that 
p' (r) = 0. 


Now set p(x) = x" — 1. Because p’ (xz) = nx"—? its only solutions are 
xz = 0 (with multiplicity n — 1). Because zero is clearly not a root of p(z), 
p(z) and p’ (rz) have no common roots. Thus all roots of p(:c) are simple. 


4.4.5 (a) Clearly, {i" | ne Z} = {i®,i',i?,i3} = Uy, so 7 is a generator. So 
also is —2. 
(b) For n fixed. es = cos 25 + isin an is clearly in U,; > (e%2=)" I 
e2™ = cos27 + isin 27 = 1. But the n numbers el* ) ,O<k<n 
are in U,,, and they are distinct. So we have a generator of U,,. 


(c) The number 1 is a generator of the additive group Z. 
4.5.1 M is a multiplicative group; N is a subgroup of M. 


(a) For s,t € M, s = t modQN is to mean st! € N. Because N is a 
multiplicative group, 1 € N (Section 4, Lemma 2). Thus for any 
s € M, s-s~' € N and so s = s modN. Suppose s = t modN. 
Then s-t-! € N. But then (s-t-!)~" € N (Section 4, Lemma 
2) and so t-s~' € N. It follows that t = s modN. Finally, let 
s =tinodN, t = u modN. Then st~! and tu~! are in N. But then 
(st~') (tu~') = su7! is in N, and we see that s = u modN (again 
we used Lemna 2 of Section 4). 
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(b) Let s’ € [s], and ¢’ € [t]. Then s’s~' € N and t't"' € N. Hence 
(ss!) (t't~1) = (s't") (st)~' € N showing s’t’ = st modN. 

(c) For any s € M, [s] + [1] = [s- 1] = [s] = [1- s] = [1] * js]. 

(d) For any s € M, s~' € M (Lemma 2, Section 4 of Chapter 4). Hence 
[s] * [s~'] = [s~"] « [s] = [ss~"] = [s—'s] = [1]. 

(e) For 2,22 € C\ {0}, 2 = zg modN means 2 - 22 le N, which 
means 2 € N. Now 2 = rye! 2 = rge2, and N =T (the unit 
circle). So a= ae —9) and this will be in T iff 2 = 1. So our 
equivalence Classes are soneentric circles centered at the origin. 


4.5.4 C is a subgroup of the additive group A. y: A — 4 is defined by setting 
y (a) = [a} alla € A. Then y(a +b) = [a + b = [a] + [b] = (a) + 9 (0d) 
for all a, b € A. Also {a € A| y(a) = [0]} = {a € A} a =0 modC} = 
{ae A|aEeC}=C. 


4.6.1 (a) Let r = > ~pwiand so wr = Diao +l Now we may 2 the 
summands as follows: r = w° + w! + w? +w? +... +0!-2 4 !-! and 
wr=wi +w% tut... tul 2% 40! +l, 

Observe that they differ only in that w® is in the sum defining r and 


w! is in the sum-defining wr. But w® = 1 = w! so they are equal. 


(b) w® —1 = (w—-1) (w" |! +w"-? +... +w +1). Now if w” = 1, then 
at least. one of the factors on the right must be zero. But w # 1; 
hence the second factor must be zero. 


a0 2 Ne nae et (a) w! =la#@e wil (b) wa = |w|? = 1 and 
so w = +; (c) the order of each w € U; is a division of | (Definition 5, 


Lemma 4). When | is a prime every member of U,, except +1, has order 
1. Thus every such member of U;. is primitive l‘"root of unity. 


4.8.2 If @,b are in Z, and @-6 = 0, then if @ has an inverse (a)~' we would have 
(a)~* (a-6) = (@)"*-6 = 0. But (a)~*- (a-6) = l(a)" a] -b=1-6 
giving us b = 0. Similarly, 6 cannot have inverse. 

4.8.3 Suppose ka = kb modn and (k,n) = 1. Then k-& =k-b in Z,, because 
(k,n) = 1, & has an inverse (k)~* € Z, (see Theorem 3). But then 


(k)~" ° (k - a) = (3 ; (k-b), giving us @ = b or, equivalently, a = 
b mod n. 


4.9.2 We have n = p-g where p is the smallest prime factor of m. Here g is a 
prime or, if not, g has a prime factor q,. Clearly, p < q, because q, is a 
prime factor of n. But then p? < pq, < pq =n. 
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4.9.5 If n is composite, then n = pq where 1 < p < n, 1 < gq < n. Thus 
on 1 = 29] = (2°)? 1 = (2P-1) (2°)? + (2)? +... 41. 
Because 2? — 1 > O and the second factor is also positive, 2" — 1 is 
composite. 
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Boas, R.P., 21 
Bombelli, 12, 21 
Bose and Kim, 164 
Bound, greatest lower, 10(2d) 
Bound, least upper, 8, 9, 10(2c) 
Bound, lower, 8, 10(2d) 
Bound, upper, 8 
Boundary data, 25, 51, 52, 54 
Bounded function, 88 
Bounded linear map, 86 
Bounded sequence, 21(7d), 42(2), 
46(4, 5) 
Bounded set, 15, 21(7d) 
Bruce, J.W., 163 
Brunelleschi, U1 
Bunyakowski, (C.S.B. inequality), 92 
Butzer, P.L. and Stark, E.L., 55 


C 

Cantor set, 11(7) 
Cardano, G., 12, 21 
Cartesian product, 3, 4, 13 
Cauchy, A.L., 18 
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Cauchy (C. S.B. inequality), 92 

Cauchy-Riemann equations, 18, 
20(4b) 

Cauchy sequence. 19, 20(1, 6a), 
21(7). 88, 181 


Cesaro, E., 43 
Cesaro convergent, 43, 46(1, 2, 4) 
Cesaro mean, 43, 44, 47 


Cesaro summable, 43 

Character of the additive group of 
integers, (Z. +), 137 

Character of the additive of integers 
modulo n. (Z,, +), 137 

Character of the additive group of real 
numbers, (R, +), 143(7) 

Characteristic of a field, 175 

Circle, 24 

Circle, unit (T), 25, 27 

Classical harmonic analysis, 23-55 

Closed ball, 83, 84(3b) 

Closed set, 15, 83, 84(3), 85, 90(4b) 

Closed under a binary operation, 165 

Closure of an ideal, 121 

Closure of a set, 89(2) 

Closure of a subspace, 89(2d) 

Codomain, 4 

Collection, | 

Commutative group; see Group, 
Abelian 

Commutative law, 179 

Commutative ring, 1272 

Compact set, L6 

Compact support, 107(3b), LO8 

Comparable elements, 6(3b) 

Complement, orthogonal, 109, 111, 
112(1) 

Complement, set-theoretic, 3(4), LS 

Complete, inner product space, 97, 
105 

Complete normed space, 89 

Complete orthonormal set; sce 
Maximal orthonormal set 

Completeness property of R, 9 

Completion, 104, LOS, Appendix B, 
181-186 


Complex conjugate, 14(1) 

Complex Fourier coefficients, 41, 51 

Complex Fourier series, 41, 51 

Complex numbers (C), Li-14 

Composite number, 153, 155 

Composition of functions, 6(2) 

C*(complex n-space), 79, 
C*(complex n-space ), norm of, 

82(iil) 

Congruence modulo a subgroup, 131, 
167 

Contained in, 2 

Continuous function, 16, 17, 19, 
20(2, 3b, 5), 26, 27, 28(3), 34 

Continuous linear map, 86 

Convergent sequence, 19, 20(1), 
21(7a. c), 85 

Convergent series, 33, 37(3, 4) 

Convolution, 114, 118(2-5), 126, 
128(3), 135 

C,(T), Continuous (real-valued) func- 
tions on the unit circle, 39, 41, 
42, 47. 49, 50, 51, 52 

C(T), continuous functions on the unit 
circle, 25-28, S1(remark 1), 
80, 89 

C,(), continuous functions on R 
with compact support, 107 
(3b) 

Cubic equation, 145 

Cyclic group; see Group, cyclic 


D 

D’alembert, 23, 31, 55 

Danish Academy of Sciences, 13 
Danizig, T., 21 

Debnath, L. and Mikusinski, P., 112 
Deficient number, 156 

del Ferro, 12, 2] 

De Morgan, 3(4a), 7(5) 

Dense set, 104, 105 

Derivative, 16, 17, 35, 36, 37, 42(3) 
Derivative, one-sided, 68, 69, 73, 74 
DeVito, C.L., 12 

Dickson, L.E., 164 


Differentiable, 16, 17, 20(2, 3c), 35, 
42(3), 126 


Differential equation, ordinary, 29, 30, 


31, 3), 272 

Differential equation, partial, 18, 25, 
30, 32 

Dilation, LO& 

Direction, 90 

Dirichlet kernel, 70 

Dirichlet problem (DP), 24, 25, 29, 
47, 52 

Disjoint sets, 2 

Disjoint sets, pairwise, 5 

Distance, 14 

Divergent, 19 

Divisor (factor) of a natural number, 
154 

Divisor of an integer, LSO 

Divisor of zero, 153 

Domain of a function, 5. 6(2) 

Domain of a relation, 4 


E 

Edwards, H.M., 163 

Equal sets, 2 

Equivalence class, 6(1), 103, 132, 


134(1), 167 
Equivalence relation, 4, 6(1, 3), 132, 
134(1). 167 


Equation, Laplace’s, 18, 20(4a), 24 

Equation, ordinary differential; see 
Differential equation, ordinary 

Equation, partial differential; see 
Differential equation, partial 

Equation polynomial, 12, 143 

Equations, Cauchy-Riemann; see 
Cauchy-Riemann equations 

Euclid, 156 

Euclidean norm, 81(i). 82(i1i) 

Euler, L., 13. 21, 23, 31, 55, 159(3c, 
6), 161 

Euler formulas, 39, 40, 41, 57 

Euler’s phi function, 154, 159, 164 

Euler’s totient function; see Euler’s 
phi function 
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Even function, 60, 61(2, 3), 62, 64(2) 


F 

Factor; see Divisor 

Family of sets, 5, 6, 7, 15, 16 

Fejér, L., 47 

Fejér’s kernel, 46(3). 70 

Fejér’s theorem, 47, 62, 102(1c), 106 

Fermat, P., 159, 16] 

Fermi paradox, 47, 55 

Ferarri, L., 12 

Field, 175 

Finite dimensional, 79-82. 84(4) 

Fontana, N. (Ta’taglia), 12, 21 

Fourier, J-B.J., 57, 77 

Fourier coefficients, 41, 42. 42(2, 3). 
51, 58, 61, 68 

Fourier cosine series, 62, 63 

Fourier series, 41, 47, 49, $1, 58. 61, 
70, 73 

Fourier sine series, 62, 63 

Fourier transform, 85 

Fourier transform on R, 123 

Fourier transform on Z, 113 

Fourier transform on Z,, 139 

Frederick the great, 163 

Fredholm, 99 

Function, 4, 5, 15(6), 23, 57 

Function space, L13 

Function, symmetric, 144 

Functional, linear, 87 

Functional, multiplicative, 120 

Functions, addition of, 80, L13 

Functions, multiplication of, 80, 115 

Functions, multiplication by scalars, 
80, L13 

Functions, L.P.. 103 


G 

Galois field, 176 

Gauss, K.F., 13 

Generator of a cyclic group; see 
Group, cyclic 

Geophysicists, LO8 

Gerver, J., 37, 55 
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Gibbs phenomenon, 76(2) 
Goldberg, R.R., 163 

Greatest common divisor, 150, LSI 
Greatest lower bound, 10(2d), 27 
Group, Abelian, 167 

Group, abstract, 165 

Group, additive, 128 

Group. cyclic, 131(5) 

Group, multiplicative, 129 


H 

Haar wavelet, 109 

Hardy, G.H., 37 

Harmonic analysis, 23 
Harmonic function, 24, 37(1, 2) 
Harriot, T., 12, 21 

Hausdorff maximal principle, 121 
Herstein, J.N., 127 

Hewitt, E. and Ross, K.A., 163 
Hilbert, D., 99 

Hilbert space, 97, LOS 
Homomorphism, 169 
Hungarians, 47 


I 

Ideal, proper, 121, 123(1), 173 

Ideal, maximal, 121 

Identity element, 166, 172 

Identity operator, 87 

Imaginary part of a complex number, 
15(6) 

Independent set; see Linearly inde- 
pendent set 

Inequality, Bessel’s, (strong) 97, 
(weak) 92 

Inequality, Cauchy-Schwarz- 
Bunyakowski, 92 

Inequality, triangle, 10(3a), 14(4b), 81 

Infinite dimensional spaces, 80, 84(4) 

Inner product, 79, 91 

Integers (Z), 9 

Integral equations, 99 

Integral, Lebesgue, 41, 103, 124 

Integral, Riemann, 17, 20(3b), 35, 57 

Integrable, 57 


Intersection, 2 

Invariant, 144, 145 

Inverse, additive, 7, 13, 114, 125 
Inverse element, 116, 119, 166 
Inverse function, 6(2b), 51 
Inverse, multiplicative, 8, 13 
Invertible, 116, 120 

Isomorphic, 170 

Isomorphism, 170 


J 
Jeffery, R.L., 55, 77 


K 

Katznelsen, Y., 163 

Kelley, J., 163 

Kernel, Dirichlet’s, 70 

Kernel, Fejér’s, 44, 70 

Kernel, Poisson’s, 55(1) 

Kernel (null space), Appendix A, 
90(4a). 180 

Kernel of a homomorphism, 170 

Konopinski, E., 47 

Kreyszig, E., 77 


L 

Lagrange, G., 143, 163 

Lagrange’s identity, Z1 

Lagrange’s solution to the cubic, 145 

Laplace's equation, 18, 24, 30 

Laplace’s equation in polar 
co-ordinates, 25 

Lebesgue integral; see Integral, 
Lebesgue 

Left-shift operater, 87 

Limit, 16, 20(3), 41. 42 

Linear combination, Appendix A, 107 

Linear functional, 87 

Linear mapping, Appendix A, 85 

Linear operater, 87 

Linear subspace, Appendix A, 89(2d), 
90(4a) 

Linearly independent set, 84(4), 89(3) 

1'(Z) (l-one of Z), 82(iv), 84(4), 113, 
16 


Lomont, J.S.. 177 

L.P. function, L03 

L.P. spaces, 103, LO4 

12(N) (complex L2 of NY), 95(6) 

1R(NN) (real 1-2 of N), 82(ii), 82(111), 
84(4), 93, 101 

1#(Z) (complex L2 of Z), 82 

1A(Z) (real 1-2 of Z), 82 

L'(R) (L-1 of R), 107(4), 123 

L'\(T) (L-1 of T), 104 

L2(R) (L-2 of R), 107(3c) 

L2(T) (L-2 of T), 105, 106 

Lucas-Lehmer test, LSZ 


M 

MacLaurin’s series, 50 

Matrix, 139, 141, 168, 172 

Maximal ideal; see Ideal, maximal 

Maximal Nest, 121 

Maximal orthonormal set, 98, LOL, 
102(1) 

Maximum, LZ, 24 

Mersenne numbers, 157, 163 

Meyer, Y., L12 

Minimum, 17, 24 

Modulus, 13 

Monomorphism, LZ0 

Morphorism, 169 

Multiplicity of a root, 131(3b) 

Multiresolution analysis, 108, 109 


N 

Natural numbers (NN), 8 

Neighborhood, 15, 83 

Nest, 121 

Norm, &1 

Norm, Euclidean, 81(i), 82(iii) 

Norm, from an inner product, 93 

Norm, sup, 88 

Normed space, &1 

Normalized vector, 91 

Normal subgroup; see Subgroup, nor- 
mal 

Null space (kernel). Appendix A. L80 
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O 

Odd function, 60, 61(2, 3). 62, 64(2) 

One-to-one function, 3 

Onto function, 5, 7(4) 

Open ball, 83, 84(3a) 

Open interval, 11(6) 

Open neighborhood, 15 

Open set, 15, 83. 84(3) 

Operator, left-shift, 87 

Operator, right-shift, 87 

Order of an nth root, 138 

Orthogonal complement, 109, 111, 
112()) 

Orthogonal set, 91 

Orthogonal vectors, 91 

Orthonormal basis, 101, LO6 

Orthonormal set, 91, 96(infinite). 
98(maximal) 


p 

Pairwise disjoint sets, 5 

Paradox, Fermi’s, 47, 55 

Paradox, Russell’s, 7(4c) 

Parallelogram identity, 94(b) 

Parseval's relation, 99(c and d) 

Partial differential equation; see 
Differential equation, partial 

Partial function, 4, 5. 6(2) 

Partial order relation, 6(3b) 

Perfect numbers, L564 

Periodic function, 25, 27, 28(1-7), 29 

Permutation, 145, 149(2, 3), 168 

P(f), set of periods of a function f, 25, 
26 

Piecewise continuous function, 65, 66, 
68, 69. 70(4), 23. 24 

Poisson’s integral, 55(1) 

Poisson’s kernel, 55(1) 

Pointwise convergence, 49, 70, 23 

Polar form of a complex number, 14 

Polar identity, 94(1d) 

Polynomial, 12. 50, 52(7). 65 

Polynomial, trigonometric, 49, 50, 
S1(1. 2) 

Preimage. 5, 90(4) 
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Prime numbers, 151, 154 

Primitive root of unity, 138, LSI 

Privalov, LL, 24 

Product, Cartesian; see Cartesian 
product 

Product, inner, 79, 9] 

Pythagorean relation, 94(Ic) 


Q 

Q (rational numbers), 9, 11(5) 
Quadratic equation, 144 
Quantum mechanics, 99, 112 


R 

Rational numbers (Q), 2 

Real numbers (R), Z 

Real part of a complex number, 15(6) 

Relation, 4 

Relation, equivalence; see 
Equivalence relation 

Relation, partial order; see Partial 
order relation 

Relation, total order; see Total order 
relation 

Relatively prime integers, L51 

Resolvent set, 123(2) 

Riemann, B., 36, 41, 68 

Riemann integral; see Integral, 
Riemann 

Right-shift operator, 87 

Ring, 172 

Ring, commutative, 172 

R"(real n-space), 79 

R(real n-space) norm of, 81(i) 

Roots of an equation, 12 

Roots of unity, 131(3), 138 

Russell paradox; see Paradox, 
Russell’s 

Ryan, R.D., L12 


Ss 

Scalar multiplication, 179 
Schroeder, M.R., 164 

Schwarz (C.S.B. inequality), 92 


Sectionally continuous function; see 
Piecewise continuous function 

Sequence, 18 

Sequence, bounded, 21(7d), 42(2) 

Sequence, Cauchy, 19, 20(5, 6), 21(7), 
88 

Sequence, convergent, 19, 49, 85 

Sequence, divergent, 19, 46(1) 

Sequence, of functions, 32, 33, 39, 49 

Series, 23, 33 

Series, convergent, 33, 39 

Series, Fourier, 41, 58, 61, 70 

Series, of functions, 33, 39 

Series, of numbers, 43, 46(5) 

Series, power, 37(3, 4, 5) 

Series, summable, 43 

Set, 1, 2,3 

SETI, 47 

Signal processing, LO8 

Silard, L., 47 

Simmons, G.F., 163 

Singular element, 120 

Space, Banach, 88, 89 

Space, Hilbert, 97 

Space, inner product, 91 

Space, normed, 81 

Space, of functions, 80 

Space, vector, Appendix A, 179 

Span, linear, Appendix A, 180 

Spectrum, 123(2) 

Subgroup, 129, 166 

Subgroup, normal, 171, 172(4) 

Subset, 2 

Subspace, Appendix A, 180 

Sums, infinite, 31 

Superposition, LO7 (see also linear 
combination) 

Support, 107(3b) 

Symmetric function; see Function, 
symmetric 


T 
Ta’taglia (N. Fontana), 12, 21 
Tauberian theorems, 46(4) 


Teller, E., 47 

Total order relation, 6(3b), 8 

Totient function; see Euler’s totient 
function 


Transform, Fourier; see Fourier trans- 


form 
Translation, 108 


Triangle inequality, 10(3a), 14(4b), 81 


Trigonometric polynomial; see 
Polynomial, trigonometric 
Trigonometric series, 23 


U 

U,, 131(3), 138 

Uniformly continuous function, 17, 
20(6), 28(7), 65, 69, 70(4) 

Uniformly convergent sequence, 32, 
33, 36, 37, 37(1), 39, 47, 
S11, 9) 

Uniformly covergent series, 33, 39 

Union, 2, 5 

Unit circle (T), 27 

Unit, ( ring with unit), 172 


V 
Vandermonde, 163 
Vector, Appendix A, 179 
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Vector, normalized, 91 

Vector, unit, 91 

Vector space, Appendix A, 179 
Vector space of functions, 80 
Vieta, 144, 146 


WwW 

Wavelet, LO8 

Webb, S., 55 

Weierstrass approximation theorem, 
35, 65, 66 

Weierstrass, K., 36 

Weierstrass M-test, 33, 36, 37 

Wessel, 13, 21 

Wiener, N., 115, 123, 163 

Witmer, T.R., 21 

Wojtaszczyk, P., 112 


Y 
York, H., 47 


Z 

Z (the integers), 9 

Zero, proper divisor of, 153 
Zero, vector, 179 

Z, (the integers modulo n), 133 


A Gentle Introduction. 


Many branches of mathematics come together in harmonic analysis; each adds richness to the subject and 
provides insights into this fascinating field. DeVito's Harmonic Analysis: A Gentle introduction presents a clear, 
comprehensive introduction to Fourier analysis and Harmonic analysis, and provides numerous examples and 
models, leaving students with a clear understanding of the theory. 


Key Features of Harmonic Analysis: A Gentle Introduction: 


* Material is presented in a unified way, avoiding the “cookbook” character of many texts. For example, in Chapter 
one an interesting boundary value problem is solved. The discussion then leads naturally to harmonic functions, 
periodic functions, and Fourier series. The various ways such series can converge are investigated in three 
separate Chapters. Roots of unity and the fast Fourier transform are discussed. Here the symbiotic interplay 
between algebra and analysis is emphasized and exploited. 


* Students are brought into the discussion by providing them with problems keyed to the material being 
discussed. Such problems are marked with a star. Solutions to starred exercises throughout the text are 
provided in an appendix. 


* The structure and format of the text is flexible, allowing the instructor to tailor it to the needs of the course and 


the needs of the students. There is a discussion of the number theory that arises in connection with the Fourier 
transform and a chapter on the relevant structures from abstract algebra. 
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